AI RESEARCH

Weak Convergence Analysis of Online Neural Actor-Critic Algorithms

arXiv CS.LG

ArXi:2403.16825v2 Announce Type: replace We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of