Understanding Data Temporality Impact on Large Language Models Pre-training

ArXi:2605.22769v1 Announce Type: new Large language models (LLMs) are typically trained on shuffled corpora, yielding models whose knowledge is frozen at train time and whose temporal grounding remains poorly understood. In this work, we study the impact of pre-