AI RESEARCH
SparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Training
arXiv CS.LG
•
ArXi:2605.27541v1 Announce Type: new