AI RESEARCH
The Geometry of Grokking: Norm Minimization on the Zero-Loss Manifold
arXiv CS.AI
•
ArXi:2511.01938v3 Announce Type: replace-cross Grokking is a puzzling phenomenon in neural networks where full generalization occurs only after a substantial delay following the complete memorization of the