Class: LLMJudge
Defined in: packages/agentos/src/core/evaluation/LLMJudge.ts:127
LLM-based judge for semantic evaluation
Constructors
Constructor
new LLMJudge(
config):LLMJudge
Defined in: packages/agentos/src/core/evaluation/LLMJudge.ts:134
Parameters
config
Returns
LLMJudge
Methods
batchJudge()
batchJudge(
evaluations,criteria?,concurrency?):Promise<JudgmentResult[]>
Defined in: packages/agentos/src/core/evaluation/LLMJudge.ts:263
Batch evaluate multiple outputs
Parameters
evaluations
object[]
criteria?
concurrency?
number = 3
Returns
Promise<JudgmentResult[]>
compare()
compare(
input,outputA,outputB,criteria?):Promise<{reasoning:string;scoreA:number;scoreB:number;winner:"A"|"B"|"tie"; }>
Defined in: packages/agentos/src/core/evaluation/LLMJudge.ts:229
Compare two outputs and determine which is better
Parameters
input
string
outputA
string
outputB
string
criteria?
Returns
Promise<{ reasoning: string; scoreA: number; scoreB: number; winner: "A" | "B" | "tie"; }>
createScorer()
createScorer(
criteria?):ScorerFunction
Defined in: packages/agentos/src/core/evaluation/LLMJudge.ts:218
Create a scorer function for use with Evaluator
Parameters
criteria?
Returns
judge()
judge(
input,actualOutput,expectedOutput?,criteria?):Promise<JudgmentResult>
Defined in: packages/agentos/src/core/evaluation/LLMJudge.ts:145
Judge an AI output against criteria
Parameters
input
string
actualOutput
string
expectedOutput?
string
criteria?
Returns
Promise<JudgmentResult>