BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

ArXi:2603.03194v2 Announce Type: replace Current code-agent benchmarks primarily evaluate localized issue resolution within a single target repository, leaving under-tested many software engineering tasks that require external knowledge or broader repository-level changes. We