Can't believe I got it working! Dual GPU - 48gb VRAM llama-cpp server - R7900 + 7800XT

r/LocalLLaMA
Generative AI AI Hardware Open Source AI

Setup: Kubuntu 24.04 - AMD cards - R9700 AI PRO and 7800xt (32gb + 16gb) - llama-cpp server - stack setup in docker - vulkan image I tried with ROCM but it wouldn't play nice with RDNA4 + RDNA3 mix. Vulkan seems to work. I tested a quick prompt, hopefully it's stable because if so, this gives me 48gb of VRAM to play with. Had to buy a new powersupply, but for $300 and to be able to leverage my older 7800xt - well worth it, I think. submitted by /u/Jorlen [link] [comments]