AI RESEARCH

UltraEP: Unleash MoE Training and Inference on Rack-Scale Nodes with Near-Optimal Load Balancing

arXiv CS.LG

ArXi:2606.04101v1 Announce Type: cross Large-scale expert parallelism (EP) is becoming pivotal for