AI RESEARCH

LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition

arXiv CS.AI

ArXi:2605.24005v1 Announce Type: new The evolution of Large Language Model (LLM) reasoning is bottlenecked by the scarcity of high-quality process data.