Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
12
21
will
willfalco
Follow
0 followers
·
6 following
AI & ML interests
None yet
Recent Activity
new
activity
about 15 hours ago
XiaomiMiMo/MiMo-V2-Flash:
Great Model! - sglang mtp support for triton backend
new
activity
10 days ago
QuantTrio/DeepSeek-V3.1-AWQ-Lite:
[request] DeepSeek-V3.1-Terminus
new
activity
10 days ago
lukealonso/MiniMax-M2-NVFP4:
you know which nightly it worked with? because it does not with current one
View all activity
Organizations
None yet
willfalco
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
XiaomiMiMo/MiMo-V2-Flash
about 15 hours ago
Great Model! - sglang mtp support for triton backend
👍
3
3
#19 opened 3 days ago by
chriswritescode
New activity in
QuantTrio/DeepSeek-V3.1-AWQ-Lite
10 days ago
[request] DeepSeek-V3.1-Terminus
4
#3 opened 13 days ago by
willfalco
New activity in
lukealonso/MiniMax-M2-NVFP4
10 days ago
you know which nightly it worked with? because it does not with current one
31
#1 opened about 1 month ago by
willfalco
New activity in
QuantTrio/DeepSeek-V3.1-AWQ-Lite
12 days ago
random atrifacts on larger outputs
2
#4 opened 13 days ago by
willfalco
liked
2 models
13 days ago
cyankiwi/Devstral-2-123B-Instruct-2512-AWQ-4bit
22B
•
Updated
13 days ago
•
3.39k
•
15
cerebras/DeepSeek-V3.2-REAP-345B-A37B
Text Generation
•
345B
•
Updated
16 days ago
•
1.75k
•
27
New activity in
Firworks/INTELLECT-3-nvfp4
16 days ago
is NVFP4 supported on sm120 (blackwell rtx pro 6000, rtx 5090 etc)?
10
#4 opened 24 days ago by
Fernanda24
New activity in
tencent/DeepSeek-V3.1-Terminus-W4AFP8
19 days ago
4 x RTX PRO 6000
👍
1
2
#1 opened 23 days ago by
willfalco
New activity in
eousphoros/DeepSeek-V3.2-NVFP4
19 days ago
Is it possible to make smaller NVFP4 quant at 340-360GB to fit in 4x96gb?
👍
1
68
#1 opened 22 days ago by
Fernanda24
New activity in
Intel/DeepSeek-V3.1-Terminus-int4-mixed-AutoRound
19 days ago
Question will it work in vllm or sglang with rtx 6000 blackwells? cuda arch sm120
6
#1 opened 2 months ago by
Fernanda24
liked
a model
19 days ago
Intel/DeepSeek-V3.1-Terminus-int4-mixed-AutoRound
Text Generation
•
2B
•
Updated
Sep 23
•
254
•
4
New activity in
QuantTrio/DeepSeek-V3.1-AWQ-Lite
19 days ago
ooof this fits in 4x96gb can we get this for the new 3.2 Speciale ase well please :)
16
#2 opened 23 days ago by
Fernanda24
New activity in
QuantTrio/DeepSeek-V3.2-AWQ
20 days ago
Aww Man!
20
#1 opened 22 days ago by
mtcl
liked
a model
22 days ago
Kwaipilot/KAT-Dev-FP8
Text Generation
•
33B
•
Updated
Oct 10
•
11
•
4
New activity in
miromind-ai/MiroThinker-v1.0-72B
25 days ago
slow by design?
1
#1 opened 25 days ago by
willfalco
liked
a model
25 days ago
Firworks/MiroThinker-v1.0-72B-nvfp4
42B
•
Updated
Nov 19
•
10
•
1
liked
a model
27 days ago
PrimeIntellect/INTELLECT-3-FP8
Text Generation
•
107B
•
Updated
28 days ago
•
2.36k
•
•
18
liked
a model
about 1 month ago
QuantTrio/GLM-4.6-GPTQ-Int4-Int8Mix
Text Generation
•
69B
•
Updated
Oct 3
•
1.06k
•
4
New activity in
QuantTrio/DeepSeek-V3.2-Exp-AWQ-Lite
about 1 month ago
anyone ran this on blackwell?
🔥
1
#2 opened about 1 month ago by
willfalco
liked
a model
about 1 month ago
QuantTrio/DeepSeek-V3.2-Exp-AWQ-Lite
Text Generation
•
685B
•
Updated
Oct 1
•
99
•
4
Load more