AI RESEARCH
LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition
arXiv CS.AI
•
ArXi:2605.24005v1 Announce Type: new The evolution of Large Language Model (LLM) reasoning is bottlenecked by the scarcity of high-quality process data.