AI RESEARCH

The Geometry of Grokking: Norm Minimization on the Zero-Loss Manifold

arXiv CS.AI

ArXi:2511.01938v3 Announce Type: replace-cross Grokking is a puzzling phenomenon in neural networks where full generalization occurs only after a substantial delay following the complete memorization of the