AI RESEARCH

StakeBench: Evaluating Language Understanding Grounded in Market Commitment

arXiv CS.AI

ArXi:2605.26074v1 Announce Type: cross Existing financial NLP benchmarks often rely on labels supplied by outside observers, measuring how language is perceived rather than what speakers have committed to in the market. We