Convergence Without Understanding: When Language Models Agree on Representations but Disagree on Reasoning

ArXi:2605.23315v1 Announce Type: cross Large language models trained under diverse objectives and architectures have been shown to develop increasingly similar internal representations, an observation formalized as the Platonic Representation Hypothesis. Whether this representational convergence extends to the reasoning processes that operate over shared representations remains untested.