Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
273.2
TFLOPS
3
1
Dan Voyce
the1dv
Follow
AI & ML interests
None yet
Organizations
None yet
the1dv
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ
5 months ago
Rope Scaling pre-applied?
6
#1 opened 6 months ago by
the1dv
New activity in
mlx-community/DeepSeek-R1-Qwen3-0528-8B-4bit-AWQ
6 months ago
ValueError: There is no module or parameter named 'lm_head.biases' in Qwen3ForCausalLM
2
#1 opened 6 months ago by
the1dv
ValueError: There is no module or parameter named 'lm_head.biases' in Qwen3ForCausalLM
2
#1 opened 6 months ago by
the1dv
New activity in
stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ
6 months ago
Rope Scaling pre-applied?
6
#1 opened 6 months ago by
the1dv
Load more