AI RESEARCH
StakeBench: Evaluating Language Understanding Grounded in Market Commitment
arXiv CS.AI
•
ArXi:2605.26074v1 Announce Type: cross Existing financial NLP benchmarks often rely on labels supplied by outside observers, measuring how language is perceived rather than what speakers have committed to in the market. We