AI RESEARCH
Distribution-Aware Reward: Reinforcement Learning over Predictive Distributions for LLM Regression
arXiv CS.LG
•
ArXi:2605.20740v1 Announce Type: new Large language models can predict real-valued quantities from heterogeneous inputs such as text, code, and molecular strings, but most