Confidence-Orchestrated Self-Evolution against Uncertain LLM Feedback

ArXi:2605.28010v1 Announce Type: new Self-evolving large language models (LLMs) tasks and solutions, reducing reliance on human-curated supervision. However, in many reasoning domains, the model must also validate generated tasks and judge generated answers to obtain