AI RESEARCH

Exploiting weight-space symmetries for approximating curvature

arXiv CS.LG

ArXi:2606.00442v1 Announce Type: new Many machine learning techniques rely on approximating a loss function's curvature, but this is notoriously hard to do at the scale of modern deep networks. Surprisingly, no previous work has exploited the curvature constraints that arise from well known weight-space symmetries in loss landscapes. By analytically averaging over group actions that leave the loss invariant, we construct structured Hessian approximations from single gradients that can be tractably estimated, d, and inverted.