DGX Spark agentic usage numbers

r/LocalLLaMA
Generative AI

What I need it to do: Be able to openclaw-type agent which is used by multiple people. What I tried: So I read in the internet about the atlas thing. I tried it, unfortunately it didn't fly for me. I tested everything on curl with long context prompt and with calls from openclaw as well. Problems: Tools cals are broken, Qwen3-coder doesn't seem to work inside atlas, TPS on long context was around 50, but on 4 concurrent it instead split to 4x16 tps Now Atlas is out of the picture, what actually is working: QuantTrio/Qwen3.6-35B-A3B-AWQ is working, but didn't yield satisfying result.