AI RESEARCH

Efficient Post-training of LLMs for Code Generation With Offline Reinforcement Learning

arXiv CS.AI

Post-