Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
cjx
sirusGray
2
1
Follow
0 followers
·
2 following
AI & ML interests
None yet
Organizations
None yet
sirusGray
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
XiaomiMiMo/MiMo-V2-Flash
5 months ago
[Bug] RoPE initialization for SWA layers modifies shared config object, causing incorrect rope_theta for non-SWA layers
#32 opened 5 months ago by
sirusGray
Will supporting mtp model in modeling?
#27 opened 5 months ago by
sirusGray