Expert-Aware Causal Tracing of Factual Recall in Sparse MoE Language Models

ArXi:2606.03780v1 Announce Type: cross Causal tracing of factual recall has been studied predominantly in dense transformer language models, where interventions localize information flow to layers or feed-forward modules. Sparse mixture-of-experts (MoE) language models