AI RESEARCH

Expert Merging in Sparse Mixture of Experts with Nash Bargaining

arXiv CS.LG

ArXi:2510.16138v2 Announce Type: replace Existing expert merging strategies for Sparse Mixture of Experts (SMoE) typically rely on input-dependent or input-independent averaging of expert parameters, but often lack a principled weighting mechanism. In this work, we reinterpret expert merging through the lens of game theory, revealing cooperative and competitive dynamics among experts. Based on this perspective, we