China models
updated
Text Generation
• Updated • 7.3k
• 32
internlm/internlm2-chat-1_8b
Text Generation
• 2B • Updated • 5.53k
• 36
Text Generation
• Updated • 26.4k
• 43
internlm/internlm2-chat-7b
Text Generation
• Updated • 81.3k
• 83
internlm/internlm2-base-20b
Text Generation
• Updated • 17.2k
• 8
Text Generation
• Updated • 21.2k
• 59
internlm/internlm2-chat-20b
Text Generation
• 20B • Updated • 20.8k
• 88
YeungNLP/firefly-pretrain-dataset
Viewer
• Updated • 2.46M • 629
• 42
9B • Updated • 216k
• 706
14B • Updated • 84.6k
• 267
9B • Updated • 10.9k
• 201
Text Generation
• 9B • Updated • 11.4k
• 145
Text Generation
• 685B • Updated • 4.02M
• • 13.3k
Text Generation
• 8B • Updated • 13
• • 32
Text Generation
• 73B • Updated • 16
• • 32
Image-Text-to-Text
• 1B • Updated • 107k
• 96
Image-Text-to-Text
• Updated • 986
• 60
Image-Text-to-Text
• 5B • Updated • 58.7k
• 63
Image-Text-to-Text
• 9B • Updated • 45.6k
• 75
Image-Text-to-Text
• 16B • Updated • 33
• 98
Image-Text-to-Text
• 35B • Updated • 520
• 142
deepseek-ai/DeepSeek-R1-Zero
Text Generation
• 685B • Updated • 7.75k
• 957
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation
• 33B • Updated • 729k
• • 1.56k
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation
• 71B • Updated • 130k
• • 773
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation
• 8B • Updated • 327k
• • 861
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Text Generation
• 8B • Updated • 625k
• • 833
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
• 2B • Updated • 616k
• • 1.51k
Text Generation
• 685B • Updated • 1.22M
• • 4.07k
deepseek-ai/DeepSeek-V3-Base
685B • Updated • 11.5k
• 1.7k
deepseek-ai/deepseek-math-7b-instruct
Text Generation
• Updated • 11.1k
• 152
deepseek-ai/deepseek-math-7b-base
Text Generation
• Updated • 3.68k
• 89
deepseek-ai/deepseek-math-7b-rl
Text Generation
• 7B • Updated • 2.95k
• 94
Text Generation
• 33B • Updated • 62.9k
• • 2.92k
Qwen/Qwen2.5-14B-Instruct-1M
Text Generation
• 15B • Updated • 19.3k
• • 338
Qwen/Qwen2.5-7B-Instruct-1M
Text Generation
• 8B • Updated • 75.9k
• • 370
qihoo360/TinyR1-32B-Preview
Text Generation
• 33B • Updated • 76
• • 324
Text Generation
• Updated • 7.05M
• • 693
Text Generation
• Updated • 2.37M
• • 398
Text Generation
• 8B • Updated • 11.7M
• • 1.09k
Text Generation
• Updated • 8.03M
• • 613
Text Generation
• 2B • Updated • 3.61M
• • 473
Text Generation
• 0.8B • Updated • 18.2M
• • 1.25k