AI RESEARCH
In-Expectation Convergence of Stochastic Gradient Methods under Heavy-Tailed Noise
arXiv CS.LG
•
ArXi:2606.00520v1 Announce Type: cross Many stochastic gradient methods are believed not to converge when the noise in stochastic gradients has only a finite $p$-th moment for $p\in\left(1,2\right)$, a setting known as the heavy-tailed noise assumption. However, some recent studies have found that Stochastic Gradient Descent ($\textsf{SGD}$), without any modification to its update rule, can surprisingly converge in expectation for convex problems with bounded domains, highlighting the potential of classical stochastic gradient methods.