AI RESEARCH

Do Models Know Why They Changed Their Mind? Interpretability and Faithfulness of Chain-of-Thought Under Knowledge Conflict

arXiv CS.AI • May 28, 2026

ArXi:2605.27773v1 Announce Type: cross When a language model sees a document contradicting its