Best coding model on RTX 3060

r/LocalLLaMA
Generative AI Open Source AI AI Tools

Wondering what’s the best coding model that can fit on a RTX 3060 (12GB). Has anyone been able to do something useful with it? Also wondering about best setup (vllm? Llama.cpp?) and quantization. Thanks a lot, this community is great submitted by /u/solimaotheelephant3 [link] [comments]