liveresearchbench
liveresearchbench
¶
LiveResearchBench (Salesforce) scorer — checklist-based evaluation.
Evaluates research reports using per-question checklists for coverage, plus LLM-as-judge for presentation quality and citation adequacy.
Reference: https://github.com/SalesforceAIResearch/LiveResearchBench
Classes¶
LiveResearchBenchSFScorer
¶
LiveResearchBenchSFScorer(judge_backend: InferenceBackend, judge_model: str)
Bases: LLMJudgeScorer
Checklist + quality scorer for Salesforce LiveResearchBench.