AI RESEARCH

Evaluating and Calibrating LLM Confidence on Questions with Multiple Correct Answers

arXiv CS.CL • June 03, 2026

ArXi:2602.07842v2 Announce Type: replace Confidence calibration is essential for making large language models (LLMs) reliable, yet existing