liveresearch
liveresearch
¶
LiveResearchBench dataset provider — deep research benchmark.
Clones the deep_research_bench repo at runtime and parses query + criteria JSONL files into EvalRecords for use with AgenticRunner.
Reference: https://github.com/Ayanami0730/deep_research_bench Paper: https://arxiv.org/abs/2510.14240
Classes¶
LiveResearchBenchDataset
¶
Bases: DatasetProvider
LiveResearchBench — deep research with 100 expert-curated tasks.
Clones Ayanami0730/deep_research_bench from GitHub (or uses a local path) and parses query + criteria JSONL files into EvalRecords.