Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
5
1
Kunqing Wang
WKQ9411
Follow
k249's profile picture
1 follower
·
3 following
WKQ9411
AI & ML interests
None yet
Recent Activity
new
activity
7 days ago
deepseek-ai/DeepSeek-V4-Pro:
Where is HCA implemented?
new
activity
8 days ago
deepseek-ai/DeepSeek-V4-Pro:
Partial Rotary Positional Embedding 的笔误?
new
activity
2 months ago
opencsg/Fineweb-Edu-Chinese-V2.2:
Some problems with synthetic data quality
View all activity
Organizations
None yet
WKQ9411
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
deepseek-ai/DeepSeek-V4-Pro
7 days ago
Where is HCA implemented?
2
#161 opened 7 days ago by
lsh-algorithm
New activity in
deepseek-ai/DeepSeek-V4-Pro
8 days ago
Partial Rotary Positional Embedding 的笔误?
👀
4
#159 opened 8 days ago by
WKQ9411
New activity in
opencsg/Fineweb-Edu-Chinese-V2.2
2 months ago
Some problems with synthetic data quality
1
#3 opened 2 months ago by
WKQ9411
New activity in
opencsg/Fineweb-Edu-Chinese-V2.2
3 months ago
This open-source release is so timely!
🤝
1
#2 opened 3 months ago by
WKQ9411
liked
a dataset
3 months ago
opencsg/Fineweb-Edu-Chinese-V2.2
Preview
•
Updated
Feb 2
•
4.56k
•
76
updated
a model
3 months ago
WKQ9411/Mini-Qwen3-Next-160M-A100M-SFT
Text Generation
•
0.2B
•
Updated
Feb 1
•
23
•
1
published
a model
3 months ago
WKQ9411/Mini-Qwen3-Next-160M-A100M-SFT
Text Generation
•
0.2B
•
Updated
Feb 1
•
23
•
1
updated
a model
3 months ago
WKQ9411/Mini-Qwen3-Next-160M-A100M-Base
Text Generation
•
0.2B
•
Updated
Feb 1
•
17
published
a model
3 months ago
WKQ9411/Mini-Qwen3-Next-160M-A100M-Base
Text Generation
•
0.2B
•
Updated
Feb 1
•
17
updated
4 models
4 months ago
WKQ9411/Mini-DeepSeekV3-160M-A100M-Base
Text Generation
•
0.2B
•
Updated
Dec 28, 2025
•
3
WKQ9411/Mini-DeepSeekV3-160M-A100M-SFT
Text Generation
•
0.2B
•
Updated
Dec 28, 2025
•
18
•
1
WKQ9411/Mini-Llama3-100M-Base
Text Generation
•
0.1B
•
Updated
Dec 28, 2025
•
12
WKQ9411/Mini-Llama3-100M-SFT
Text Generation
•
0.1B
•
Updated
Dec 28, 2025
•
5
published
4 models
4 months ago
WKQ9411/Mini-DeepSeekV3-160M-A100M-SFT
Text Generation
•
0.2B
•
Updated
Dec 28, 2025
•
18
•
1
WKQ9411/Mini-DeepSeekV3-160M-A100M-Base
Text Generation
•
0.2B
•
Updated
Dec 28, 2025
•
3
WKQ9411/Mini-Llama3-100M-SFT
Text Generation
•
0.1B
•
Updated
Dec 28, 2025
•
5
WKQ9411/Mini-Llama3-100M-Base
Text Generation
•
0.1B
•
Updated
Dec 28, 2025
•
12
New activity in
meta-llama/Llama-4-Scout-17B-16E-Original
about 1 year ago
Open Source License
2
#2 opened about 1 year ago by
harisnaeem