Scott Glover
scottgl
AI & ML interests
None yet
Recent Activity
new activity about 8 hours ago
mudler/Step-3.7-Flash-APEX-GGUF:MTP weights liked a model about 8 hours ago
mudler/Step-3.7-Flash-APEX-GGUF liked a model about 8 hours ago
mudler/Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-APEX-MTP-GGUFOrganizations
None yet
MTP weights
#1 opened about 8 hours ago
by
scottgl
MTP weights
#2 opened about 9 hours ago
by
scottgl
Step-3.7-Flash quant support with the MTP GGUF models
1
#2 opened 21 days ago
by
scottgl
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
🤯👍 5
6
#5 opened 4 months ago
by
scottgl
Quantization Code
1
#1 opened 2 months ago
by
vgoklani
Issues for GB10 users
2
#1 opened 2 months ago
by
scottgl
NVFP4 quantization of m51Lab-MiniMax-M2.7-REAP-139B-A10B
3
#1 opened 2 months ago
by
scottgl
Minimax 2.7
5
#1 opened 2 months ago
by
dustinogle1
Excellent model on DGX Spark
👍 1
4
#1 opened 4 months ago
by
bkmtech
Recommendations for running on Strix Halo.
2
#2 opened 4 months ago
by
scottgl
MTP model weights
#3 opened 4 months ago
by
scottgl
MTP model weights
#3 opened 4 months ago
by
scottgl
MTP results with vLLM inside
7
#10 opened 4 months ago
by
unoid
[Bug] Model outputs only "!" — quantization_config.ignore missing fused projection names (in_proj_ba / in_proj_qkvz) for linear attention layers
4
#4 opened 4 months ago
by
scottgl
MTP Added - Re-download
🔥🚀 2
7
#7 opened 4 months ago
by
Sehyo
Qwen3.5 122B on Stix Halo
5
#1 opened 4 months ago
by
scottgl
MTP support in model
5
#5 opened 4 months ago
by
scottgl
Could you create an NVFP4 version?
#2 opened 4 months ago
by
scottgl
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
🤯👍 5
6
#5 opened 4 months ago
by
scottgl