AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

ArXi:2606.01961v1 Announce Type: new Autonomous agents are increasingly expected to end-to-end medical-AI research workflows, moving beyond isolated prediction tasks or short-form clinical question answering. However, existing medical agent benchmarks primarily evaluate final outputs, providing limited visibility into agent behavior within the research process.