EDUCATION & TRAINING
Microsoft Unveils MAI-Thinking-1: 35B Active, 1T Parameters, 97% on AIME 2025
Dev.to Machine Learning
About This Tutorial
Microsoft's MAI-Thinking-1 hits 97% on AIME 2025 with 35B active params in a 1T MoE model, trained on 30T human tokens without distillation. Microsoft unveiled MAI-Thinking-1, a 35B active parameter reasoning model scoring 97% on AIME 2025. The model is the first output of what Microsoft calls a 'hill-climbing machine' - a closed-loop pipeline for iteratively improving reasoning models. Key facts MAI-Thinking-1: 35B active, 1T total MoE parameters. 97.0% on AIME 2025 math benchmark. 87.7% on LiveCodeBench v6 coding benchmark. 52.8% on SWE-Bench Pro software engineering benchmark.