gemma-4-12b-it vs Qwen3.5-9B on shared benchmarks: Qwen is overall winner beating gemma in 5/8 benchmarks despite a smaller footprint

r/LocalLLaMA
Machine Learning Generative AI Open Source AI AI Research AI Tools

I don't really understand the gemma hype. Qwen outperforms gemma gb for gb, and k cache is lighter. Sure gemma-4-12b-it might be a slight better coder than Qwen3.5-9b, but you could also just use omnicoder-9b (Qwen3.5-9b finetune for coding). Note: Benchmark results come from the official huggingface model cards; formatted into a table with ChatGPT submitted by /u/fulgencio_batista [link] [comments]