GH200 NVL2 or 8x RTX 6000 Blackwell for running Kimi K2.6 / DeepSeek V4 locally? (5 devs, agentic coding)
r/LocalLLaMA
•
Generative AI
Open Source AI
Trying to figure out the right box for my team and wanted to see if anyone had any clue which would be a better fit or if it is not worth our time in our budget. Situation: 5 of us doing agentic coding (lots of long context getting re-sent every turn, parallel tool calls, etc.) and we want to self-host the latest open MoE models - Kimi K2.6 and DeepSeek V4 class. My boss likes the idea of having it in house so no point in just saying pay the API (I did pitch that) Budget is around $100k - $150k.