AI RESEARCH
Understanding Data Temporality Impact on Large Language Models Pre-training
arXiv CS.CL
•
ArXi:2605.22769v1 Announce Type: new Large language models (LLMs) are typically trained on shuffled corpora, yielding models whose knowledge is frozen at train time and whose temporal grounding remains poorly understood. In this work, we study the impact of pre-