AI RESEARCH

STAMP: Training Explicit Memory for Mobile GUI Agents in Controllable and Scalable Virtual Environments

arXiv CS.CV

ArXi:2605.29324v1 Announce Type: cross Mobile GUI agents excel at immediate reactive control but frequently fail in realistic, long-horizon tasks that require memory. This failure stems from a fundamental conflict between limited context windows and token-heavy screenshots. To save the limited context, agents must progressively discard older visual history, permanently losing crucial transient information.