AI RESEARCH

Distribution-Aware Reward: Reinforcement Learning over Predictive Distributions for LLM Regression

arXiv CS.LG

ArXi:2605.20740v1 Announce Type: new Large language models can predict real-valued quantities from heterogeneous inputs such as text, code, and molecular strings, but most