Function judgeAgent

judgeAgent(
    cfg: JudgeAgentConfig,
): {
    call: (
        input: AgentInput,
    ) => Promise<
        | never[]
        | {
            messages: CoreMessage[];
            metCriteria: string[];
            reasoning: string;
            success: boolean;
            unmetCriteria: string[];
        },
    >;
    criteria: string[];
    role: JUDGE;
}
Agent that evaluates conversations against success criteria.

The JudgeAgent watches conversations in real-time and makes decisions about whether the agent under test is meeting the specified criteria. It can either allow the conversation to continue or end it with a success/failure verdict.

The judge uses function calling to make structured decisions and provides detailed reasoning for its verdicts. It evaluates each criterion independently and provides comprehensive feedback about what worked and what didn't.
Parameters
- cfg: JudgeAgentConfig
  Configuration for the judge agent.
  
  Configuration for the judge agent.
  - criteria: string[]
    The criteria that the judge will use to evaluate the conversation.
  - OptionalmaxTokens?: number
    The maximum number of tokens to generate.
  - Optionalmodel?: LanguageModelV1
    The language model to use for generating responses. If not provided, a default model will be used.
  - Optionalname?: string
    The name of the agent.
  - OptionalsystemPrompt?: string
    A custom system prompt to override the default behavior of the judge.
  - Optionaltemperature?: number
    The temperature for the language model. Defaults to 0.
Returns {
    call: (
        input: AgentInput,
    ) => Promise<
        | never[]
        | {
            messages: CoreMessage[];
            metCriteria: string[];
            reasoning: string;
            success: boolean;
            unmetCriteria: string[];
        },
    >;
    criteria: string[];
    role: JUDGE;
}
Example
```
import { run, judgeAgent, AgentRole, user, agent, AgentAdapter } from '@langwatch/scenario';

const myAgent: AgentAdapter = {
  role: AgentRole.AGENT,
  async call(input) {
    return `The user said: ${input.messages.at(-1)?.content}`;
  }
};

async function main() {
  const result = await run({
    name: "Judge Agent Test",
    description: "A simple test to see if the judge agent works.",
    agents: [
      myAgent,
      judgeAgent({
        criteria: ["The agent must respond to the user."],
      }),
    ],
    script: [
      user("Hello!"),
      agent(),
    ],
  });
}
main();
```
- Defined in src/agents/judge-agent.ts:129

Function judgeAgent

Parameters

criteria: string[]

`Optional`maxTokens?: number

`Optional`model?: LanguageModelV1

`Optional`name?: string

`Optional`systemPrompt?: string

`Optional`temperature?: number

Example

Settings

Function judgeAgent

Parameters

criteria: string[]

OptionalmaxTokens?: number

Optionalmodel?: LanguageModelV1

Optionalname?: string

OptionalsystemPrompt?: string

Optionaltemperature?: number

Returns { call: ( input: AgentInput, ) => Promise< | never[] | { messages: CoreMessage[]; metCriteria: string[]; reasoning: string; success: boolean; unmetCriteria: string[]; }, >; criteria: string[]; role: JUDGE;}

Example

Settings

`Optional`maxTokens?: number

`Optional`model?: LanguageModelV1

`Optional`name?: string

`Optional`systemPrompt?: string

`Optional`temperature?: number

Returns {
call: (
input: AgentInput,
) => Promise<
| never[]
| {
messages: CoreMessage[];
metCriteria: string[];
reasoning: string;
success: boolean;
unmetCriteria: string[];
},
>;
criteria: string[];
role: JUDGE;
}