AI RESEARCH

SparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Training

arXiv CS.LG

ArXi:2605.27541v1 Announce Type: new