ACE: Self-Evolving LLM Coding Framework via Adversarial Unit Test Generation and Preference Optimization

ArXi:2605.16299v2 Announce Type: replace-cross Large Language Models (LLMs) excel at code generation but remain heavily reliant on large-scale annotated solutions and verification-based supervision, which constrains scalability and hinders sustained self-improvement. Recent solver--verifier frameworks exploit program execution as an automatic supervision signal, but their effectiveness degrades as solvers become moderately strong: verifier-generated tests increasingly confirm semantic correctness rather than exposing the remaining failure modes.