Skip to main content

Streams an evaluator model's analysis comparing two previously generated A/B responses.

POST /studio/compare_results

Streams the evaluator model's analysis (as SSE) against the original prompt, then persists the structured verdict onto the parent OrcaRequest.

Validates the request id and loads the associated OrcaRequest with its tasks (A/B).
Constructs a system instruction plus user messages containing the original prompt and both responses.
Streams the evaluator's analysis with SSE headers; after streaming, persists the full analysis to the request.
Responds with appropriate HTTP codes for missing/invalid request or if not found.

Request

Responses

200
401
403

OK