TRT-LLM - Convert for trtllm-build
1
#5 opened 3 days ago
by
marcelo-soria-shrinkit
6 months since intro of NVFP4, and it's basically still a myth
1
#4 opened 5 months ago
by
zenmagnets
NVFP4 Quantization for Qwen3-coder-30b-a3b-instruct
#3 opened 5 months ago
by
eryuanren
Is this model a MoE model from Qwen3-30B-A3B?
1
#2 opened 7 months ago
by
L-Hongbin
Serve with vLLM
🔥 1
4
#1 opened 8 months ago
by
faheemraza1