Base Model
updated
mistralai/Mistral-Small-3.1-24B-Base-2503
Updated • 5.58k
• 272
Text Generation
• Updated • 282k
• • 167
Text Generation
• 22B • Updated • 7.23M
• • 4.59k
Text Generation
• 685B • Updated • 3.84M
• • 13.3k
Text Generation
• Updated • 10.8k
• 301
baidu/ERNIE-4.5-0.3B-Base-PT
Text Generation
• Updated • 1.64k
• 22
Text Generation
• 1B • Updated • 1.61M
• • 2.38k
Updated • 34.6k
• 1.08k
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
• 2B • Updated • 450k
• • 1.5k
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
Image-Text-to-Text
• 30B • Updated • 2.07k
• 537
deepseek-ai/DeepSeek-R1-Zero
Text Generation
• 685B • Updated • 4.41k
• 956
Text Generation
• 9B • Updated • 23.5k
• • 104
Text Generation
• 0.4B • Updated • 21.7k
• 249
Text Generation
• Updated • 338
• 41
microsoft/Phi-4-mini-flash-reasoning
Text Generation
• Updated • 892
• 275
Qwen/Qwen3-VL-2B-Instruct
Image-Text-to-Text
• 2B • Updated • 187M
• 392
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation
• 685B • Updated • 212k
• • 994
tencent/Hunyuan-0.5B-Pretrain
Text Generation
• 0.5B • Updated • 1.56k
• 11
Text Generation
• 7B • Updated • 195k
• 68