AI RESEARCH

BigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents

arXiv CS.AI

ArXi:2606.03829v1 Announce Type: new Financial-research answers are decision-relevant only when another analyst can audit how they were produced: which source was chosen, which period and accounting definition were used, which assumptions were made, and how the calculation was performed. Existing finance benchmarks largely evaluate isolated subskills or final answers, leaving the auditable derivation itself under-measured. We