Small Language Models
updated
facebook/opt-iml-max-1.3b
Text Generation
• Updated • 2.84k
• 43
Text Generation
• Updated • 25.2k
• 87
togethercomputer/RedPajama-INCITE-Chat-3B-v1
Text Generation
• Updated • 1.25k
• 152
Text Generation
• 3B • Updated • 324k
• 503
Text Generation
• 3B • Updated • 70k
• 33
cerebras/Cerebras-GPT-2.7B
Text Generation
• Updated • 1.31k
• 46
M4-ai/TinyMistral-6x248M-Instruct
Text Generation
• 1B • Updated • 45
• 11
M4-ai/NeuralReyna-Mini-1.8B-v0.3
Text Generation
• 2B • Updated • 95
• • 11
stabilityai/stablelm-2-zephyr-1_6b
Text Generation
• 2B • Updated • 4.13k
• 187
stabilityai/stable-code-instruct-3b
Text Generation
• 3B • Updated • 2.07k
• 186
stabilityai/stablelm-zephyr-3b
Text Generation
• 3B • Updated • 33.2k
• 261
TinyLlama/TinyLlama-1.1B-Chat-v1.0
Text Generation
• 1B • Updated • 2.82M
• • 1.58k
Text Generation
• 1B • Updated • 38.7k
• 30
Text Generation
• 4B • Updated • 18.1k
• • 46
Text Generation
• 2B • Updated • 4.53M
• • 162
Qwen/Qwen2.5-Coder-1.5B-Instruct
Text Generation
• 2B • Updated • 341k
• • 121
Text Generation
• 3B • Updated • 8.13M
• • 455
Text Generation
• 3B • Updated • 78.4k
• • 878
Text Generation
• 3B • Updated • 167k
• 173
Text Generation
• 3B • Updated • 26.4k
• • 98
Text Generation
• Updated • 88
• 24
Text Generation
• 1B • Updated • 8.62k
• • 220
Text Generation
• 3B • Updated • 378k
• • 1.35k
Text Generation
• 1B • Updated • 68.8k
• • 1.36k
Text Generation
• 3B • Updated • 529k
• • 3.45k
ministral/Ministral-3b-instruct
Text Generation
• 3B • Updated • 9.08k
• 86
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
• 2B • Updated • 13.4k
• 117
h2oai/h2o-danube-1.8b-chat
Text Generation
• 2B • Updated • 201
• 55
h2oai/h2o-danube2-1.8b-chat
Text Generation
• 2B • Updated • 851
• 62
h2oai/h2o-danube3-4b-chat
Text Generation
• 4B • Updated • 746
• 68
h2oai/h2o-danube3.1-4b-chat
Text Generation
• 4B • Updated • 626
• 5
Text Generation
• 1B • Updated • 469
• 42
Text Generation
• 6B • Updated • 48.2k
• 70
Text Generation
• 6B • Updated • 6.82k
• 41
Updated • 97
• 258
Updated • 133k
• 1.16k
Text Generation
• 4B • Updated • 24.1k
• 12
meta-llama/Llama-3.2-3B-Instruct
Text Generation
• 3B • Updated • 2.42M
• • 2.13k
NousResearch/Hermes-3-Llama-3.2-3B
Text Generation
• 3B • Updated • 7.36k
• • 179
ibm-granite/granite-3b-code-instruct-2k
Text Generation
• 3B • Updated • 8.18k
• 39
ibm-granite/granite-3.0-2b-instruct
Text Generation
• 3B • Updated • 7.04k
• 49
HuggingFaceTB/SmolLM2-1.7B
Text Generation
• 2B • Updated • 159k
• 150
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
• 2B • Updated • 536k
• • 1.5k
apple/OpenELM-3B-Instruct
Text Generation
• 3B • Updated • 3.17k
• 339
internlm/internlm2-chat-1_8b
Text Generation
• 2B • Updated • 5.89k
• 36
internlm/internlm2_5-1_8b-chat
Text Generation
• 2B • Updated • 1.69k
• 26
agentica-org/DeepScaleR-1.5B-Preview
Text Generation
• 2B • Updated • 8.58k
• • 583
microsoft/Phi-3-mini-128k-instruct
Text Generation
• Updated • 246k
• 1.7k
microsoft/Phi-4-mini-instruct
Text Generation
• Updated • 1.53M
• • 739
Text Generation
• 1.0B • Updated • 657k
• • 950
Text Generation
• Updated • 178
• 122
ibm-granite/granite-3.3-2b-instruct
Text Generation
• Updated • 32.4k
• 85
Text Generation
• 4B • Updated • 531
• • 501
Text Generation
• 3B • Updated • 181k
• 952
Qwen/Qwen3-4B-Instruct-2507
Text Generation
• 4B • Updated • 11M
• • 841
Qwen/Qwen3-4B-Thinking-2507
Text Generation
• 4B • Updated • 502k
• • 585
Text Generation
• 3B • Updated • 6.41k
• 188
ibm-granite/granite-4.0-h-micro
Text Generation
• 3B • Updated • 6.58k
• 144
Alibaba-Apsara/DASD-4B-Thinking
Text Generation
• Updated • 310
• • 236
mistralai/Ministral-3-3B-Reasoning-2512
4B • Updated • 51.6k
• 115
mistralai/Ministral-3-3B-Instruct-2512
Updated • 258k
• 235
Text Generation
• 2B • Updated • 2.88k
• 227
Nanbeige/Nanbeige4-3B-Thinking-2511
Text Generation
• 4B • Updated • 993
• 206
Text Generation
• 4B • Updated • 233k
• • 1.11k
LiquidAI/LFM2.5-1.2B-Instruct
Text Generation
• 1B • Updated • 531k
• 586
LiquidAI/LFM2.5-1.2B-Thinking
Text Generation
• 1B • Updated • 31.3k
• 345
Text Generation
• 4B • Updated • 447
• • 78
janhq/Jan-v3-4B-base-instruct
Text Generation
• 4B • Updated • 90
• • 62
CohereLabs/tiny-aya-global
Text Generation
• 3B • Updated • 20.6k
• • 152
Image-Text-to-Text
• 5B • Updated • 7.1M
• • 533