AI RESEARCH
VISTA: An End-to-End Benchmark for Visual Spec-to-Web-App Coding Agents
arXiv CS.AI
•
ArXi:2605.26144v1 Announce Type: cross We present VISTA (VIsual Spec-To-App Benchmark), a benchmark for evaluating the end-to-end web-app generation capabilities of LLM-based agents. Unlike prior code generation benchmarks that focus on algorithmic tasks, VISTA targets realistic UI-centric development, where agents must produce functional, visually coherent applications from underspecified inputs.