Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
AXERA-TECH
/
DeepSeek-R1-Distill-Qwen-1.5B
like
5
Follow
AXERA
116
Text Generation
Transformers
Chinese
English
Context
DeepSeek-R1-Distill-Qwen-1.5B
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
b818976
DeepSeek-R1-Distill-Qwen-1.5B
10 MB
Ctrl+K
Ctrl+K
2 contributors
History:
2 commits
qqc1989
Upload 6 files
b818976
verified
over 1 year ago
deepseek-r1_tokenizer
Upload 6 files
over 1 year ago
.gitattributes
Safe
1.57 kB
Upload 6 files
over 1 year ago
README.md
Safe
33 Bytes
initial commit
over 1 year ago
deepseek-r1_tokenizer.py
Safe
4.27 kB
Upload 6 files
over 1 year ago
main_prefill
3 MB
xet
Upload 6 files
over 1 year ago
run_deepseek-r1_1.5B_ax630c.sh
Safe
512 Bytes
Upload 6 files
over 1 year ago
run_deepseek-r1_1.5B_ax650.sh
Safe
509 Bytes
Upload 6 files
over 1 year ago