Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
cjx
sirusGray
Follow
0 followers
·
2 following
AI & ML interests
None yet
Recent Activity
new
activity
about 2 months ago
XiaomiMiMo/MiMo-V2-Flash:
[Bug] RoPE initialization for SWA layers modifies shared config object, causing incorrect rope_theta for non-SWA layers
new
activity
about 2 months ago
XiaomiMiMo/MiMo-V2-Flash:
Will supporting mtp model in modeling?
liked
a model
about 2 months ago
XiaomiMiMo/MiMo-V2-Flash
View all activity
Organizations
None yet
sirusGray
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
about 2 months ago
XiaomiMiMo/MiMo-V2-Flash
Text Generation
•
310B
•
Updated
26 days ago
•
158k
•
•
672