Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
luchangli03
luchangli03
Follow
0 followers
·
4 following
AI & ML interests
None yet
Recent Activity
liked
a model
6 days ago
lightseekorg/kimi-k2.5-eagle3
new
activity
10 days ago
AQ-MedAI/Kimi-K25-eagle3:
Can you reduce the kv head num of this model? "num_key_value_heads": 64, which requies a lots of kv cache
liked
a model
11 days ago
AQ-MedAI/Kimi-K25-eagle3
View all activity
Organizations
luchangli03
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
AQ-MedAI/Kimi-K25-eagle3
10 days ago
Can you reduce the kv head num of this model? "num_key_value_heads": 64, which requies a lots of kv cache
2
#1 opened 10 days ago by
luchangli03
New activity in
jerryzh168/Kimi-K2-Thinking-FP8
about 1 month ago
Can you provide the code that convert the int4 weight to fp8? thanks
#2 opened about 1 month ago by
luchangli03
New activity in
chutesai/DeepSeek-V3.1-Terminus-NextN
4 months ago
what's the difference between this nextn and self contained MTP model in DeepSeek-V3.1-Terminus?
#1 opened 4 months ago by
luchangli03