llama.cpp - Qwen3.6/3.5-MTP - Share your benchmarks t/s

r/LocalLLaMA
Generative AI Open Source AI

I think the dust has settled(95+%) for Qwen3.6/3.5-MTP. After the initial PR, so much optimizations & fixes. Even sometime ago today, there's a MTP related PR got merged & released( b9495 ). So try this latest version & share your benchmarks t/s*. Great work by u/am17an & other folks. * - Please share all stuff so it would be useful for others too. Also without particular missing details, benchmarks becomes inaccurate. Also I/We would like to have most optimized full command to get best t/s.