AI RESEARCH
From Layers to Submodules: Rethinking Granularity in Replacement-Based LLM Compression
arXiv CS.AI
•
Post-
Post-