AI RESEARCH
Inferring the Size of Large Language Models From Popular Text Memorization
arXiv CS.LG
•
ArXi:2605.29223v1 Announce Type: new The parameter counts of the most widely used large language models (LLMs) are often withheld by their developers, leaving model size -- a primary reference point for interpreting capabilities and costs -- largely undisclosed. We propose a black-box method to infer conservative lower bounds on LLM size from generated text outputs alone, requiring nothing beyond the ability to submit text fragments and observe next-token predictions.