AI RESEARCH

From Layers to Submodules: Rethinking Granularity in Replacement-Based LLM Compression

arXiv CS.AI

Post-