AI RESEARCH

Proof-Refactor: Refactoring Generated Formal Proofs into Modular Artifacts

arXiv CS.AI

ArXi:2606.03743v1 Announce Type: new While Large Language Models (LLMs) have shown strong performance in generating formal proofs, their outputs often remain less readable, modular, maintainable, and reusable than proofs in mature formal mathematics libraries. We argue that this gap stems in part from the compile-first objective implicit in most proof-generation pipelines, which encourages monolithic or ad hoc proof scripts rather than library-quality artifacts. Existing approaches to proof-quality improvement often rely on explicit, computable optimization objectives.