AI RESEARCH
UltraEP: Unleash MoE Training and Inference on Rack-Scale Nodes with Near-Optimal Load Balancing
arXiv CS.LG
•
ArXi:2606.04101v1 Announce Type: cross Large-scale expert parallelism (EP) is becoming pivotal for