AI RESEARCH
Yes, Q-learning Helps Offline In-Context RL
arXiv CS.AI
•
ArXi:2502.17666v4 Announce Type: replace-cross Existing offline in-context reinforcement learning (ICRL) methods have predominantly relied on supervised