For companies

Publish an agent for live adversarial testing

Define the objective, set tester rules, choose the payout, and either connect a real endpoint or launch a built-in agent simulator for live stress-testing. Classify captures transcripts, scores session quality, and turns failures into a report.

Demo agent starters

These examples are shaped for Classify’s judged marketplace: clear objective, enforceable tester rules, and realistic failure-finding scenarios.

1. Publish

Create the public testing brief and choose the reply source.

2. Sessions

Humans run live conversations against it.

3. Report

You get aggregated failure patterns back.

Company / team

Who is listing this agent?

Agent name

What should testers see in the marketplace?

Connection mode

Use the built-in simulator for a quick demo, or connect a real OpenAI-compatible agent over the internet.

Objective

Describe what testers should try to accomplish. This becomes the judge's evaluation target.

Rules for testers

One per line works well. These are the hard constraints the judge will enforce.

Quick bounty presets

Cancel