AI RESEARCH

Not All Flips Are Conformity: Decomposing Stance Convergence in Multi-Agent LLM Debate

arXiv CS.CL

ArXi:2606.00820v1 Announce Type: new Multi-agent debate (MAD) is a promising strategy for improving LLM reasoning, but when agents converge on a shared answer, it is unclear whether that convergence reflects genuine deliberation or social compliance. We show that the conventional answer flip rate conflates three distinct mechanisms: spontaneous instability, stance-induced conformity, and reasoning-induced persuasion. Our three-source decomposition framework isolates each through controlled counterfactual conditions.