AI RESEARCH
Weak Convergence Analysis of Online Neural Actor-Critic Algorithms
arXiv CS.LG
•
ArXi:2403.16825v2 Announce Type: replace We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of