Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
luchangli03
luchangli03
Follow
Steve-Guo's profile picture
1 follower
·
4 following
AI & ML interests
None yet
Organizations
luchangli03
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
AQ-MedAI/Kimi-K25-eagle3
2 months ago
Can you reduce the kv head num of this model? "num_key_value_heads": 64, which requies a lots of kv cache
2
#1 opened 2 months ago by
luchangli03
New activity in
jerryzh168/Kimi-K2-Thinking-FP8
3 months ago
Can you provide the code that convert the int4 weight to fp8? thanks
#2 opened 3 months ago by
luchangli03
New activity in
chutesai/DeepSeek-V3.1-Terminus-NextN
5 months ago
what's the difference between this nextn and self contained MTP model in DeepSeek-V3.1-Terminus?
#1 opened 5 months ago by
luchangli03