Has anyone successfully run it on two GB10 machines?

#3
by win10 - opened

Has anyone successfully run it on two GB10 machines?

Hi, that’s exactly my setup to run it

Any particular questions?

There’s a bug in vLLM about expert parallelism with TP>1 - so just drop this flag and it should be fine

I think the output speed is very fast, what do you think?

Hi, that’s exactly my setup to run it

Any particular questions?

There’s a bug in vLLM about expert parallelism with TP>1 - so just drop this flag and it should be fine

Yes absolutely!
I was very surprised by the speed.

It is very usable in terms of quality and performance and ATM my daily workhorse.

Need to try DFlash with Qwen3.5 27b though but MiniMax M2.7 is the best I’ve tried so far on 2xGB10

Sign up or log in to comment