Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
25
1
38
CobraMamba
CobraMamba
Follow
liesun1994's profile picture
Tsungming's profile picture
Luigi's profile picture
13 followers
·
2 following
https://github.com/chi2liu
633WHU
AI & ML interests
None yet
Recent Activity
published
a model
15 days ago
CobraMamba/aaaa
published
a model
5 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO3
updated
a model
5 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO2
View all activity
Organizations
None yet
CobraMamba
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
a model
15 days ago
CobraMamba/aaaa
Updated
15 days ago
published
a model
5 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO3
Updated
Aug 11
updated
a model
5 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO2
Text Generation
•
2B
•
Updated
Aug 10
•
9
published
2 models
5 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO2
Text Generation
•
2B
•
Updated
Aug 10
•
9
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO
Updated
Aug 10
updated
a model
8 months ago
CobraMamba/Qwen3-30B-A3B-AWQ-4Bit
Text Generation
•
31B
•
Updated
May 9
•
170
updated
a collection
8 months ago
Qwen-AWQ
Collection
4 items
•
Updated
May 9
published
a model
8 months ago
CobraMamba/Qwen3-30B-A3B-AWQ-4Bit
Text Generation
•
31B
•
Updated
May 9
•
170
New activity in
ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts
8 months ago
How to Only compress non-shared experts within transformer blocks?
1
#1 opened 8 months ago by
CobraMamba
liked
a model
8 months ago
ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts
Text Generation
•
104B
•
Updated
Apr 8
•
38
•
4
updated
a collection
8 months ago
Qwen-AWQ
Collection
4 items
•
Updated
May 9
updated
a model
8 months ago
CobraMamba/Qwen3-32B-AWQ
Text Generation
•
33B
•
Updated
Apr 30
•
17
published
a model
8 months ago
CobraMamba/Qwen3-32B-AWQ
Text Generation
•
33B
•
Updated
Apr 30
•
17
updated
a collection
8 months ago
Qwen-AWQ
Collection
4 items
•
Updated
May 9
updated
a model
8 months ago
CobraMamba/Qwen3-8B-AWQ
Text Generation
•
8B
•
Updated
Apr 30
•
16
published
a model
8 months ago
CobraMamba/Qwen3-8B-AWQ
Text Generation
•
8B
•
Updated
Apr 30
•
16
New activity in
CobraMamba/mamba-gpt-7b
8 months ago
Adding `safetensors` variant of this model
#2 opened about 1 year ago by
SFconvertbot
New activity in
CobraMamba/mamba-gpt-7b-v2
8 months ago
Adding `safetensors` variant of this model
#2 opened about 1 year ago by
SFconvertbot
New activity in
CobraMamba/mamba-gpt-7b-v1
8 months ago
Base Model
1
#2 opened about 1 year ago by
Shameless111
Adding `safetensors` variant of this model
#3 opened 12 months ago by
SFconvertbot
Load more