AI RESEARCH
WorkstreamBench: Evaluating LLM Agents on End-to-End Spreadsheet Tasks in Finance
arXiv CS.AI
•
ArXi:2605.22664v1 Announce Type: new LLM agents are increasingly expected to carry out end-to-end workflows, producing complete artifacts from high-level user instructions. To meet enterprise needs, frontier AI labs have developed agents that can construct entire spreadsheets from scratch. This is especially relevant in finance, where core workflows such as financial modeling, forecasting, and scenario analysis are commonly conducted through spreadsheets.