AI RESEARCH
STAMP: Training Explicit Memory for Mobile GUI Agents in Controllable and Scalable Virtual Environments
arXiv CS.CV
•
ArXi:2605.29324v1 Announce Type: cross Mobile GUI agents excel at immediate reactive control but frequently fail in realistic, long-horizon tasks that require memory. This failure stems from a fundamental conflict between limited context windows and token-heavy screenshots. To save the limited context, agents must progressively discard older visual history, permanently losing crucial transient information.