AI RESEARCH

ClinEnv: An Interactive Multi-Stage Long Horizon EHR Environment for Agents

arXiv CS.AI

ArXi:2606.02568v1 Announce Type: new Clinical practice is not the selection of an answer from enumerated options: a physician gathers heterogeneous information incrementally and commits to sequential, irreversible decisions under uncertainty. Static benchmarks cannot probe and existing interactive medical benchmarks each compromise on at least one of them. We present ClinEn, an interactive benchmark that evaluates LLMs as attending physicians over real inpatient admissions under a paradigm we term Longitudinal Inpatient Simulation.