AI RESEARCH
Evaluating and Calibrating LLM Confidence on Questions with Multiple Correct Answers
arXiv CS.CL
•
ArXi:2602.07842v2 Announce Type: replace Confidence calibration is essential for making large language models (LLMs) reliable, yet existing