wildchat_judge
wildchat_judge
¶
WildChat scorer — dual-comparison LLM-as-judge.
Adapted from IPW's wildchat.py evaluation handler.
Classes¶
WildChatScorer
¶
WildChatScorer(judge_backend: InferenceBackend, judge_model: str)
Bases: LLMJudgeScorer
Dual-comparison LLM-as-judge for chat quality.