Cannot get NCCL test to run in docker with 2 x 6000 Pro connected x8 to AM4 CPU

r/LocalLLaMA
AI Hardware AI Tools

Nvidia-smi topo -m is showing the both GPU as PHB (i.e. via CPU) connected as expected but I cannot get NCCL all_reduce_perf to run at all, it always hangs after starting up. It seems that vllm won't work with TP=2 until I can fix this. Is there any reason why this setup would not work (it's X570 based)? TIA submitted by /u/NaiRogers [link] [comments]