Shihan Qu
zenmagnets
AI & ML interests
None yet
Recent Activity
new activity about 17 hours ago
Qwen/Qwen3.5-397B-A17B:Qwen3.6 397b new activity 1 day ago
varjosoft/GLM-5.1-Open-TQ3:Pending GPU & vLLM validation new activity 3 days ago
MiniMaxAI/MiniMax-M2.7:No commercial use allowed in License?Organizations
None yet
Qwen3.6 397b
#75 opened about 17 hours ago
by
zenmagnets
Pending GPU & vLLM validation
3
#1 opened 6 days ago
by
nwzjk
No commercial use allowed in License?
😔👀 4
9
#6 opened 3 days ago
by
zenmagnets
license
👀👍 9
6
#5 opened 3 days ago
by
festr2
How to run on vLLM for 4xSM120
#1 opened about 1 month ago
by
zenmagnets
Here's the vLLM recipe I'm using with 2x RTX Pro 6000
👍 3
17
#1 opened 2 months ago
by
zenmagnets
Anyone get this working on 4x RTX 6000 Pro?
👀 2
5
#1 opened about 2 months ago
by
zenmagnets
Throughput NVFP4 on Dual 6000 Blackwells
#2 opened about 2 months ago
by
zenmagnets
Anyone try this on 4x RTX 6000 Pro yet?
52
#1 opened about 2 months ago
by
zenmagnets
I wish it would fit in 2x6000 PRO!
1
#2 opened about 2 months ago
by
mtcl
"w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected."
👍 1
21
#2 opened 2 months ago
by
zenmagnets
Wasn't able to recreate MMLU-Pro benchmarks
5
#5 opened 3 months ago
by
zenmagnets
Enormous KV-cache size?
👍➕ 6
23
#3 opened 3 months ago
by
nephepritou
Really appreciate that you ran performance comparison tests with BF16!
3
#2 opened 3 months ago
by
zenmagnets
Performance comps with BF16?
1
#3 opened 3 months ago
by
zenmagnets
Any plans for a 6bit or 8bit version?
1
#3 opened 3 months ago
by
zenmagnets
If 8bit, why shaped like 16 bit
2
#2 opened 3 months ago
by
zenmagnets
6 months since intro of NVFP4, and it's basically still a myth
1
#4 opened 4 months ago
by
zenmagnets
Works with vllm? Any recommendations or howtos?
7
#1 opened 6 months ago
by
DrRos
What Token Generation are you guys getting with CPU only?
#3 opened 5 months ago
by
zenmagnets