Gemma 4 12b 8Q Heretic Oneshot Coding
r/LocalLLaMA
•
Open Source AI
I was pretty impressed with the Gemma 4 12b release today and saw that the heretic version dropped. I was already getting refusals from the 8Q official model and decided to see how the heretic did oneshotting a retro game. It did so with ease. The single prompt start to finish ate 45k tokens total. Hardware Stack: Ryzen 9 9950X + AMD RX 6800 (16GB VRAM) via Vulkan back-end 32GB 6000 System Ram. Model & Config: H-gemma-4-12B-heretic-Q8.gguf running with 8-bit KV Cache ( --cache-type-k q8_0 --cache-type- q8_0.