AI RESEARCH

Chess-World-Model: A 10M-Game Benchmark for Exact State Tracking from Chess Move Sequences

arXiv CS.LG

ArXi:2605.30100v1 Announce Type: new World models require state tracking, which is the ability to maintain a correct latent state across action sequences. Existing benchmarks are often synthetic or language-based, limiting their value as tests of structured state updates in realistic domains. We