AI RESEARCH

Reasoning-preserved Efficient Distillation of Large Language Models via Activation-aware Initialization

arXiv CS.LG

ArXi:2605.29327v1 Announce Type: cross Efficient Distillation (EDistill) compresses large language models (LLMs) by structured pruning parameters and tuning lightweight modules with high