AI RESEARCH

Verifiable Benchmarking of Long-Horizon Spatial Biology

arXiv CS.AI

ArXi:2605.28065v1 Announce Type: new AI agents are increasingly useful for biological data analysis, but existing benchmarks mostly test broad biological knowledge, executable workflows, or localized analysis steps rather than end-to-end scientific reasoning over spatial measurements. We