AI RESEARCH
Confidence-Orchestrated Self-Evolution against Uncertain LLM Feedback
arXiv CS.AI
•
ArXi:2605.28010v1 Announce Type: new Self-evolving large language models (LLMs) tasks and solutions, reducing reliance on human-curated supervision. However, in many reasoning domains, the model must also validate generated tasks and judge generated answers to obtain