Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
17
6
zhangchenchen
zhnagchenchne
Follow
adom593's profile picture
21world's profile picture
2 followers
·
4 following
AI & ML interests
None yet
Recent Activity
liked
a model
10 days ago
openai/circuit-sparsity
new
activity
about 2 months ago
Qwen/Qwen3-Omni-30B-A3B-Thinking:
vLLM serve for Qwen3-Omni currently only supports the thinker model.
new
activity
3 months ago
ByteDance-Seed/Seed-OSS-36B-Instruct:
Hiding Thinking Process
View all activity
Organizations
zhnagchenchne
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
10 days ago
openai/circuit-sparsity
Text Generation
•
0.4B
•
Updated
14 days ago
•
1.99k
•
186
New activity in
Qwen/Qwen3-Omni-30B-A3B-Thinking
about 2 months ago
vLLM serve for Qwen3-Omni currently only supports the thinker model.
#8 opened about 2 months ago by
zhnagchenchne
New activity in
ByteDance-Seed/Seed-OSS-36B-Instruct
3 months ago
Hiding Thinking Process
2
#25 opened 4 months ago by
Xzendor7
Where is the inference/vllm_tool_call.py inference/vllm_chat.py generate.py ?
2
#27 opened 3 months ago by
zhnagchenchne
New activity in
moonshotai/Kimi-K2-Instruct
5 months ago
What actually is the EOS token for this model?
4
#31 opened 5 months ago by
jukofyork
New activity in
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
5 months ago
Keep outputting empty content without stopping
2
#3 opened 5 months ago by
zhnagchenchne
New activity in
adamo1139/DeepSeek-R1-0528-AWQ
5 months ago
About AWQ
#3 opened 5 months ago by
zhnagchenchne
New activity in
moonshotai/Kimi-K2-Instruct
5 months ago
Run 1T-param on A100/H100(80G)x8 using FP4
🚀
🔥
5
7
#9 opened 6 months ago by
ghostplant
New activity in
Qwen/Qwen3-Embedding-0.6B
6 months ago
onnx-community/Qwen3-Embedding-0.6B-ONNX
#25 opened 6 months ago by
zhnagchenchne
New activity in
QuixiAI/DeepSeek-R1-0528-AWQ
7 months ago
The result is problematic.
1
#3 opened 7 months ago by
zhnagchenchne
tokenizer_config.json
#2 opened 7 months ago by
zhnagchenchne
liked
a model
7 months ago
QuixiAI/DeepSeek-R1-0528-AWQ
Text Generation
•
671B
•
Updated
Jun 1
•
58
•
19
liked
a model
8 months ago
deepseek-ai/DeepSeek-Prover-V2-671B
Text Generation
•
685B
•
Updated
Apr 30
•
632
•
•
815
New activity in
deepseek-ai/DeepSeek-Prover-V2-671B
8 months ago
DeepSeek-Prover-V1 的升级版?
1
#13 opened 8 months ago by
zhnagchenchne
liked
2 models
9 months ago
deepseek-ai/DeepSeek-V3-0324
Text Generation
•
685B
•
Updated
Mar 27
•
201k
•
•
3.08k
QuixiAI/DeepSeek-V3-0324-AWQ
Text Generation
•
671B
•
Updated
Mar 29
•
146
•
23
updated
a model
9 months ago
YiXin-AILab/YiXin-Distill-Qwen-72B-AWQ
73B
•
Updated
Mar 25
•
11
•
1
published
a model
9 months ago
YiXin-AILab/YiXin-Distill-Qwen-72B-AWQ
73B
•
Updated
Mar 25
•
11
•
1
New activity in
YiXin-AILab/YiXin-Distill-Qwen-72B
9 months ago
Calibration dataset
1
#4 opened 9 months ago by
AlphaGaO
Please increase code reasoning skills
2
#5 opened 9 months ago by
xldistance
Load more