Small Language Models
updated
facebook/opt-iml-max-1.3b
Text Generation
• Updated • 960
• 43
Text Generation
• Updated • 35.8k
• 88
togethercomputer/RedPajama-INCITE-Chat-3B-v1
Text Generation
• Updated • 439
• 152
Text Generation
• 3B • Updated • 39.4k
• 503
Text Generation
• 3B • Updated • 40.4k
• 34
cerebras/Cerebras-GPT-2.7B
Text Generation
• Updated • 989
• 47
M4-ai/TinyMistral-6x248M-Instruct
Text Generation
• 1B • Updated • 22
• 11
M4-ai/NeuralReyna-Mini-1.8B-v0.3
Text Generation
• 2B • Updated • 108
• 11
stabilityai/stablelm-2-zephyr-1_6b
Text Generation
• 2B • Updated • 3.26k
• 187
stabilityai/stable-code-instruct-3b
Text Generation
• 3B • Updated • 2.36k
• 187
stabilityai/stablelm-zephyr-3b
Text Generation
• 3B • Updated • 28.2k
• 261
TinyLlama/TinyLlama-1.1B-Chat-v1.0
Text Generation
• 1B • Updated • 2.2M
• • 1.65k
Text Generation
• 1B • Updated • 46.4k
• 30
Text Generation
• 4B • Updated • 16.2k
• 46
Text Generation
• 2B • Updated • 3.79M
• • 162
Qwen/Qwen2.5-Coder-1.5B-Instruct
Text Generation
• 2B • Updated • 781k
• • 131
Text Generation
• 3B • Updated • 7.89M
• • 513
Text Generation
• 3B • Updated • 89.9k
• • 921
Text Generation
• 3B • Updated • 178k
• 173
Text Generation
• 3B • Updated • 34.3k
• • 101
Text Generation
• 3B • Updated • 13
• 24
Text Generation
• 1B • Updated • 15.1k
• • 222
Text Generation
• 3B • Updated • 388k
• • 1.41k
Text Generation
• 1B • Updated • 60.9k
• • 1.36k
Text Generation
• 3B • Updated • 577k
• • 3.47k
ministral/Ministral-3b-instruct
Text Generation
• 3B • Updated • 5.57k
• 86
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
• 2B • Updated • 5.71k
• 119
h2oai/h2o-danube-1.8b-chat
Text Generation
• 2B • Updated • 742
• 55
h2oai/h2o-danube2-1.8b-chat
Text Generation
• 2B • Updated • 466
• 62
h2oai/h2o-danube3-4b-chat
Text Generation
• 4B • Updated • 768
• 68
h2oai/h2o-danube3.1-4b-chat
Text Generation
• 4B • Updated • 8
• 5
Text Generation
• 1B • Updated • 5.93k
• 42
Text Generation
• 6B • Updated • 17.7k
• 70
Text Generation
• 6B • Updated • 5.25k
• 42
Updated • 198
• 258
6B • Updated • 120k
• 1.16k
Text Generation
• 4B • Updated • 791
• 12
meta-llama/Llama-3.2-3B-Instruct
Text Generation
• 3B • Updated • 2.02M
• • 2.28k
NousResearch/Hermes-3-Llama-3.2-3B
Text Generation
• 3B • Updated • 4.79k
• • 181
ibm-granite/granite-3b-code-instruct-2k
Text Generation
• 3B • Updated • 6.05k
• 40
ibm-granite/granite-3.0-2b-instruct
Text Generation
• 3B • Updated • 3.4k
• 49
HuggingFaceTB/SmolLM2-1.7B
Text Generation
• 2B • Updated • 188k
• 153
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
• 2B • Updated • 654k
• • 1.53k
apple/OpenELM-3B-Instruct
Text Generation
• 3B • Updated • 1.2k
• 340
internlm/internlm2-chat-1_8b
Text Generation
• 2B • Updated • 5.59k
• 36
internlm/internlm2_5-1_8b-chat
Text Generation
• 2B • Updated • 2.58k
• 26
agentica-org/DeepScaleR-1.5B-Preview
Text Generation
• 2B • Updated • 6.53k
• • 584
microsoft/Phi-3-mini-128k-instruct
Text Generation
• 4B • Updated • 245k
• 1.7k
microsoft/Phi-4-mini-instruct
Text Generation
• 4B • Updated • 666k
• • 784
Text Generation
• 1.0B • Updated • 3.82M
• • 1.02k
Updated • 197
• 122
ibm-granite/granite-3.3-2b-instruct
Text Generation
• 3B • Updated • 24.8k
• 85
Text Generation
• 4B • Updated • 447
• • 503
Text Generation
• 3B • Updated • 657k
• 981
Qwen/Qwen3-4B-Instruct-2507
Text Generation
• 4B • Updated • 5.4M
• • 886
Qwen/Qwen3-4B-Thinking-2507
Text Generation
• 4B • Updated • 593k
• • 600
Text Generation
• 3B • Updated • 8.57k
• 188
ibm-granite/granite-4.0-h-micro
Text Generation
• 3B • Updated • 8.66k
• 147
Alibaba-Apsara/DASD-4B-Thinking
Text Generation
• 4B • Updated • 38
• • 234
mistralai/Ministral-3-3B-Reasoning-2512
4B • Updated • 31.3k
• 116
mistralai/Ministral-3-3B-Instruct-2512
4B • Updated • 1.18M
• 253
Text Generation
• 2B • Updated • 292
• 228
Nanbeige/Nanbeige4-3B-Thinking-2511
Text Generation
• 4B • Updated • 298
• 208
Text Generation
• 4B • Updated • 5.11k
• • 1.14k
LiquidAI/LFM2.5-1.2B-Instruct
Text Generation
• 1B • Updated • 124k
• 620
LiquidAI/LFM2.5-1.2B-Thinking
Text Generation
• 1B • Updated • 48.7k
• 374
Text Generation
• 4B • Updated • 719
• • 79
janhq/Jan-v3-4B-base-instruct
Text Generation
• 4B • Updated • 64
• • 62
CohereLabs/tiny-aya-global
Text Generation
• 3B • Updated • 19.5k
• • 163
Image-Text-to-Text
• 5B • Updated • 8.8M
• • 693