Working VLLM Command
#10 opened 25 days ago
by
nmitchko
Unable to use full 192k context in SGLang with MiniMax-M2.7-NVFP4 (runtime capped at ~80,964 tokens)
3
#9 opened about 1 month ago
by
mtcl
"KLD reduced by ~10%."
#8 opened about 1 month ago
by
vgoklani
Next-level!
👍 2
#7 opened about 1 month ago
by
mayhem4markets
2 DGX Spark cluster recipe
🤗 2
#6 opened about 2 months ago
by
susni
tokenizer component mismatch and w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected issue
1
#5 opened about 2 months ago
by
mtcl
Working configuration for Nvidia Blackwell
13
#4 opened about 2 months ago
by
luismiguelsaez
Calibration Dataset Mixture
1
#3 opened about 2 months ago
by
vgoklani
Thanks, thanks and more thanks. Many thanks.
15
#2 opened about 2 months ago
by
aaron-newsome
w1 not matching w3 weight scales
12
#1 opened about 2 months ago
by
dareposte