Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

AXERA-TECH
/
DeepSeek-R1-Distill-Qwen-1.5B

Text Generation
Transformers
Chinese
English
Context
DeepSeek-R1-Distill-Qwen-1.5B
Model card Files Files and versions
xet
Community
DeepSeek-R1-Distill-Qwen-1.5B
10 MB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 2 commits
qqc1989's picture
qqc1989
Upload 6 files
b818976 verified over 1 year ago
  • deepseek-r1_tokenizer
    Upload 6 files over 1 year ago
  • .gitattributes
    1.57 kB
    Upload 6 files over 1 year ago
  • README.md
    33 Bytes
    initial commit over 1 year ago
  • deepseek-r1_tokenizer.py
    4.27 kB
    Upload 6 files over 1 year ago
  • main_prefill
    3 MB
    xet
    Upload 6 files over 1 year ago
  • run_deepseek-r1_1.5B_ax630c.sh
    512 Bytes
    Upload 6 files over 1 year ago
  • run_deepseek-r1_1.5B_ax650.sh
    509 Bytes
    Upload 6 files over 1 year ago