AI RESEARCH
BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?
arXiv CS.CL
•
ArXi:2603.03194v2 Announce Type: replace Current code-agent benchmarks primarily evaluate localized issue resolution within a single target repository, leaving under-tested many software engineering tasks that require external knowledge or broader repository-level changes. We