AI RESEARCH
Efficient Post-training of LLMs for Code Generation With Offline Reinforcement Learning
arXiv CS.AI
•
Post-
Post-