club-rdna16: practical 16GB AMD/Radeon local LLM testing repo

r/LocalLLaMA
Generative AI AI Hardware Open Source AI

Following on from club-5060ti, I’ve been doing some testing with my desktop AMD GPU and wanted to make a similar repo for 16GB Radeon cards. Repo: Pages/results: The first test machine is an RX 6900 XT 16GB running llama.cpp with ROCm/HIP. I’ve mainly been testing Qwen3.6 27B and Qwen3.6 35B-A3B using the Unsloth MTP GGUFs, currently using the UD-IQ3_XXS model quant with q8 KV cache. The repo is meant to be practical rather than a synthetic leaderboard.