Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
254.6
TFLOPS
21
1
GadflyII
GadflyII
Follow
DimensionSTP's profile picture
AlexGS74's profile picture
limegreenpeper1's profile picture
29 followers
·
1 following
AI & ML interests
None yet
Recent Activity
liked
a model
7 days ago
ibm-granite/granite-4.1-8b-fp8
new
activity
21 days ago
GadflyII/GLM-4.6V-NVFP4:
Well done nvfp4 quant
new
activity
21 days ago
GadflyII/Qwen3-Coder-Next-NVFP4:
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
View all activity
Organizations
GadflyII
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
7 days ago
ibm-granite/granite-4.1-8b-fp8
Text Generation
•
9B
•
Updated
7 days ago
•
7.46k
•
12
New activity in
GadflyII/GLM-4.6V-NVFP4
21 days ago
Well done nvfp4 quant
3
#1 opened 3 months ago by
josephbreda
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
21 days ago
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
🤯
👍
5
6
#5 opened 2 months ago by
scottgl
New activity in
GadflyII/GLM-4.7-Flash-MTP-NVFP4
about 2 months ago
SGLang and MTP
1
#2 opened 2 months ago by
Michalea
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
2 months ago
Model requests?
12
#4 opened 3 months ago by
pathosethoslogos
New activity in
GadflyII/GLM-4.6V-NVFP4
2 months ago
Fails on a single DGX spark with errors below
1
#2 opened 2 months ago by
Adrian1234
New activity in
GadflyII/GLM-4.7-Flash-MXFP4
3 months ago
Update MXFP4 format to compressed-tensors
1
#3 opened 3 months ago by
mgoin
New activity in
lukealonso/MiniMax-M2.5-NVFP4
3 months ago
Here's the vLLM recipe I'm using with 2x RTX Pro 6000
👍
3
17
#1 opened 3 months ago by
zenmagnets
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
3 months ago
MMLU PRO Benchmark
3
#3 opened 3 months ago by
sevapru
vLLM 0.16?
1
#2 opened 3 months ago by
MMaxHugg
Memory
1
#1 opened 3 months ago by
struxx
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
3 months ago
confused response
7
#8 opened 3 months ago by
jiangyizhi
updated
a model
3 months ago
GadflyII/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
Feb 4
•
14k
•
43
published
a model
3 months ago
GadflyII/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
Feb 4
•
14k
•
43
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
3 months ago
MTP quality, 47 layer
3
#7 opened 3 months ago by
Michalea
updated
a model
3 months ago
GadflyII/GLM-4.7-Flash-MTP-NVFP4
Text Generation
•
19B
•
Updated
Feb 2
•
1.54k
•
5
New activity in
GadflyII/GLM-4.7-Flash-MTP-NVFP4
3 months ago
Upload folder using huggingface_hub
#1 opened 3 months ago by
GadflyII
published
a model
3 months ago
GadflyII/GLM-4.7-Flash-MTP-NVFP4
Text Generation
•
19B
•
Updated
Feb 2
•
1.54k
•
5
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
3 months ago
Can't deploy by vllm 0.14.1 + transformers
8
#6 opened 3 months ago by
Butterfly-314
New activity in
GadflyII/GLM-4.7-Flash-MXFP4
3 months ago
can not run
4
#1 opened 3 months ago by
aliez-ren
Load more