Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
cyberagent
/
DeepSeek-R1-Distill-Qwen-14B-Japanese
like
95
Follow
CyberAgent
657
Text Generation
Safetensors
Japanese
qwen2
japanese
conversational
arxiv:
2501.12948
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-14B-Japanese
Commit History
Update README.md
cc1ebcc
verified
rishigami
commited on
Jan 27, 2025
Upload tokenizer
8658147
verified
rishigami
commited on
Jan 27, 2025
Upload Qwen2ForCausalLM
c473b37
verified
rishigami
commited on
Jan 27, 2025
initial commit
7457875
verified
rishigami
commited on
Jan 27, 2025