Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
563.0
TFLOPS
17
12
Gleb Kurchanov
nephepritou
Follow
webxos's profile picture
21world's profile picture
2 followers
·
9 following
AI & ML interests
None yet
Recent Activity
new
activity
29 days ago
Lasimeri/MiniMax-M2.7-int4-AutoRound:
OutOfMemory during weights loading (vLLM)
liked
a model
about 2 months ago
olka-fi/Qwen3.5-122B-A10B-MXFP4
new
activity
2 months ago
cyankiwi/Qwen3.5-122B-A10B-AWQ-4bit:
Updated weights
View all activity
Organizations
None yet
nephepritou
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
Lasimeri/MiniMax-M2.7-int4-AutoRound
29 days ago
OutOfMemory during weights loading (vLLM)
#1 opened 29 days ago by
nephepritou
liked
a model
about 2 months ago
olka-fi/Qwen3.5-122B-A10B-MXFP4
Text Generation
•
71B
•
Updated
Feb 25
•
490
•
10
New activity in
cyankiwi/Qwen3.5-122B-A10B-AWQ-4bit
2 months ago
Updated weights
2
#5 opened 2 months ago by
nephepritou
liked
a model
3 months ago
fishaudio/s2-pro
Text-to-Speech
•
5B
•
Updated
Mar 11
•
158k
•
997
New activity in
Intel/Qwen3.5-122B-A10B-int4-AutoRound
3 months ago
Does the A100 work?
12
#1 opened 3 months ago by
xz123321
New activity in
Sehyo/Qwen3.5-122B-A10B-NVFP4
3 months ago
Quantization instruction
5
#6 opened 3 months ago by
nephepritou
liked
a model
3 months ago
Qwen/Qwen3.5-122B-A10B-FP8
Image-Text-to-Text
•
125B
•
Updated
Apr 24
•
938k
•
100
New activity in
unsloth/Qwen3-Coder-Next-FP8-Dynamic
4 months ago
Inconsistent output (resolved)
5
#2 opened 4 months ago by
nephepritou
liked
2 models
4 months ago
Qwen/Qwen3-Coder-Next
Text Generation
•
80B
•
Updated
Feb 3
•
880k
•
•
1.41k
Qwen/Qwen3-Coder-Next-FP8
Text Generation
•
80B
•
Updated
Feb 3
•
721k
•
149
New activity in
Qwen/Qwen3-Coder-Next
4 months ago
Very specific json formatting issue in tool calls
➕
1
4
#14 opened 4 months ago by
deleted
liked
a model
4 months ago
meituan-longcat/LongCat-Flash-Lite
Text Generation
•
69B
•
Updated
Feb 6
•
3.26k
•
187
New activity in
zai-org/GLM-4.7-Flash
4 months ago
Thank you Z.AI, I love this model! ❤
👀
❤️
8
5
#43 opened 4 months ago by
MrDevolver
Model breaks apart when used with different languages
2
#38 opened 4 months ago by
nephepritou
Enormous KV-cache size?
👍
➕
6
23
#3 opened 4 months ago by
nephepritou
Why does the KV cache occupy so much GPU memory?
13
#21 opened 4 months ago by
yyg201708
New activity in
cyankiwi/GLM-4.5-Air-AWQ-4bit
6 months ago
Running on 4 GPUs with TP=4
3
#11 opened 6 months ago by
nephepritou
New activity in
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
6 months ago
Tool calling with reasoning parsing broken
11
#3 opened 6 months ago by
nephepritou
New activity in
cyankiwi/GLM-4.6V-AWQ-4bit
6 months ago
Question about group size
2
#1 opened 6 months ago by
nephepritou
New activity in
cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit
6 months ago
Model not loading
7
#8 opened 7 months ago by
nephepritou
Load more