Model ID used for the single-pass creation review. Should be a fast, cost-efficient model since correctness is primarily validated through test cases.
"gpt-4o-mini"
Model ID used by both reviewers in the promotion panel.
Should be a more capable model than judgeModel since promotion
decisions are higher-stakes.
"gpt-4o"
Callback that invokes an LLM to generate text from a prompt. The judge calls this for creation reviews and promotion panels.
The model ID to use for generation.
The full prompt string to send to the LLM.
The raw text response from the LLM.
Configuration for the EmergentJudge.
All LLM interaction is abstracted behind the
generateTextcallback, making the judge model-agnostic and easily testable with mocks.