LLMs
updated
ContextualAI/Contextual_KTO_Mistral_PairRM
Text Generation
• 7B • Updated • 128
• 32
snorkelai/Snorkel-Mistral-PairRM-DPO
Text Generation
• Updated • 712
• 108
state-spaces/mamba-2.8b-hf
Text Generation
• 3B • Updated • 10.6k
• 111
h2oai/h2o-danube-1.8b-base
Text Generation
• 2B • Updated • 176
• 43
Text Generation
• 9B • Updated • 8.99k
• 187
NousResearch/Genstruct-7B
Text Generation
• 7B • Updated • 70
• 403
AetherResearch/Cerebrum-1.0-7b
Text Generation
• 7B • Updated • 27
• • 51
abacusai/Liberated-Qwen1.5-72B
Text Generation
• Updated • 530
• 101
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
7B • Updated • 4.11k
• 246
Crystalcareai/Gemma-7b-Fixed
Text Generation
• 9B • Updated • 91
• 3
openchat/openchat-3.5-0106-gemma
Text Generation
• 9B • Updated • 4.09k
• 57
openchat/openchat-3.5-0106
Text Generation
• Updated • 13.1k
• 359
HuggingFaceH4/zephyr-7b-gemma-sft-v0.1
Text Generation
• 9B • Updated • 150
• 12
HuggingFaceH4/zephyr-7b-gemma-v0.1
Text Generation
• Updated • 149
• 124
ibm-research/merlinite-7b
Text Generation
• 7B • Updated • 215
• 105
Text Generation
• Updated • 175
• 41
316B • Updated • 18.2k
• 70
Feature Extraction
• 7B • Updated • 13
• 21
Feature Extraction
• 7B • Updated • 4
• 7
Text Generation
• Updated • 69
• 6
Text Generation
• 9B • Updated • 3.23k
• 252
CohereLabs/c4ai-command-r-plus
Text Generation
• Updated • 2.81k
• 1.78k
h2oai/h2o-danube2-1.8b-base
Text Generation
• 2B • Updated • 395
• 47
Text Generation
• 9B • Updated • 13.9k
• 275
stabilityai/stablelm-2-12b
Text Generation
• 12B • Updated • 2.04k
• 120
stabilityai/stablelm-2-12b-chat
Text Generation
• Updated • 283
• 88
google/recurrentgemma-2b-it
Text Generation
• Updated • 3.03k
• 111
Text Generation
• 3B • Updated • 6.52k
• 95
mistral-community/Mixtral-8x22B-v0.1
Text Generation
• 141B • Updated • 203
• 671
meta-llama/Meta-Llama-3-8B
Text Generation
• 8B • Updated • 3.35M
• • 6.5k
meta-llama/Meta-Llama-3-8B-Instruct
Text Generation
• 8B • Updated • 1.42M
• • 4.44k
meta-llama/Meta-Llama-3-70B
Text Generation
• 71B • Updated • 284k
• • 874
meta-llama/Meta-Llama-3-70B-Instruct
Text Generation
• 71B • Updated • 81.4k
• • 1.51k
Snowflake/snowflake-arctic-instruct
Text Generation
• Updated • 13.1k
• 360
nvidia/Llama3-ChatQA-1.5-8B
Text Generation
• Updated • 11.1k
• 555
Text Generation
• 236B • Updated • 18.4k
• 333
deepseek-ai/DeepSeek-V2-Chat
Text Generation
• 236B • Updated • 11.6k
• 461
Text Generation
• 7B • Updated • 2.65k
• 14
Text Generation
• 11B • Updated • 5.2k
• 218
Text Generation
• 6B • Updated • 12.4k
• 31
Text Generation
• 6B • Updated • 6.39k
• 41
Text Generation
• 9B • Updated • 13.3k
• 52
Text Generation
• 9B • Updated • 18.8k
• 148
Text Generation
• 34B • Updated • 8.35k
• 48
Text Generation
• 34B • Updated • 13.3k
• 276
Fugaku-LLM/Fugaku-LLM-13B
Text Generation
• Updated • 21
• 130
Fugaku-LLM/Fugaku-LLM-13B-instruct
Text Generation
• 13B • Updated • 45
• 28
NousResearch/Hermes-2-Theta-Llama-3-8B
Text Generation
• 8B • Updated • 10.7k
• • 204
prometheus-eval/prometheus-7b-v2.0
Text Generation
• 7B • Updated • 59.3k
• 103
prometheus-eval/prometheus-8x7b-v2.0
Text Generation
• 47B • Updated • 1.11k
• 49
microsoft/Phi-3-medium-4k-instruct
Text Generation
• 14B • Updated • 11.9k
• 225
microsoft/Phi-3-medium-128k-instruct
Text Generation
• Updated • 4.96k
• 387
microsoft/Phi-3-small-128k-instruct
Text Generation
• 7B • Updated • 1.27k
• 181
microsoft/Phi-3-small-8k-instruct
Text Generation
• 7B • Updated • 19.9k
• 175
mistralai/Mistral-7B-v0.3
7B • Updated • 310k
• 572
mistralai/Mistral-7B-Instruct-v0.3
7B • Updated • 2.17M
• 2.48k
nvidia/Llama3-ChatQA-1.5-70B
Text Generation
• Updated • 233
• • 333
Text Generation
• 8B • Updated • 10.3k
• 430
Text Generation
• Updated • 4k
• 291
Text Generation
• 73B • Updated • 42k
• • 718
Text Generation
• 73B • Updated • 31.3k
• • 200
nvidia/Nemotron-4-340B-Base
Updated • 1.06k
• 147
nvidia/Nemotron-4-340B-Instruct
Updated • 3.2k
• 694
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
Text Generation
• 8B • Updated • 8.33k
• • 83
instruction-pretrain/instruction-synthesizer
Text Generation
• 7B • Updated • 31
• 79
Text Generation
• Updated • 60.2k
• • 695
Text Generation
• 9B • Updated • 266k
• • 781
Text Generation
• Updated • 11.3k
• 210
Text Generation
• 27B • Updated • 333k
• 561
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation
• 236B • Updated • 3.99k
• 177
mistralai/Mistral-Nemo-Base-2407
12B • Updated • 75.8k
• 342
mistralai/Mistral-Nemo-Instruct-2407
Updated • 124k
• 1.66k
mistralai/Mamba-Codestral-7B-v0.1
7B • Updated • 29.8k
• 613
mistralai/Mathstral-7B-v0.1
7B • Updated • 14.4k
• 241
HuggingFaceTB/SmolLM-135M
Text Generation
• 0.1B • Updated • 171k
• 254
HuggingFaceTB/SmolLM-360M
Text Generation
• Updated • 28.5k
• 70
HuggingFaceTB/SmolLM-1.7B
Text Generation
• 2B • Updated • 64.8k
• 181
HuggingFaceTB/SmolLM-135M-Instruct
Text Generation
• 0.1B • Updated • 103k
• 134
HuggingFaceTB/SmolLM-360M-Instruct
Text Generation
• 0.4B • Updated • 9.79k
• 84
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
• 2B • Updated • 13.8k
• 117
7B • Updated • 78
• 834
Text Generation
• Updated • 2.42k
• 135
Text Generation
• 73B • Updated • 91
• 51
Text Generation
• Updated • 399
• • 86
Feature Extraction
• 8B • Updated • 809
• 15
Text Generation
• Updated • 14.3k
• 69
meta-llama/Llama-Guard-3-8B
Text Generation
• 8B • Updated • 94.8k
• • 283
meta-llama/Llama-3.1-8B-Instruct
Text Generation
• 8B • Updated • 8.34M
• • 5.64k
Text Generation
• 8B • Updated • 1.4M
• • 2.13k
meta-llama/Prompt-Guard-86M
Text Classification
• 0.3B • Updated • 23.9k
• • 318
meta-llama/Llama-3.1-405B-Instruct
Text Generation
• 406B • Updated • 168k
• 593
meta-llama/Llama-3.1-405B
Text Generation
• 406B • Updated • 380k
• 965
Text Generation
• Updated • 529k
• 636
Text Generation
• 3B • Updated • 374k
• • 1.31k
Text Generation
• 3B • Updated • 2.25k
• 79
internlm/internlm2_5-20b-chat
Text Generation
• 20B • Updated • 547
• 93
internlm/internlm2_5-7b-chat
Text Generation
• 8B • Updated • 43.8k
• 200
internlm/internlm2_5-7b-chat-1m
Text Generation
• 8B • Updated • 2.43k
• 72
internlm/internlm2_5-1_8b-chat
Text Generation
• 2B • Updated • 1.93k
• 25
Text Generation
• 20B • Updated • 81
• 17
Text Generation
• 8B • Updated • 3.39k
• 18
internlm/internlm2_5-1_8b
Text Generation
• Updated • 1.4k
• 24
LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
Text Generation
• Updated • 27.6k
• 415
Image-Text-to-Text
• Updated • 128k
• 1.04k
microsoft/Phi-3.5-mini-instruct
Text Generation
• 4B • Updated • 916k
• 965
microsoft/Phi-3.5-MoE-instruct
Text Generation
• Updated • 94.4k
• 571
Audio-Text-to-Text
• Updated • 6.38k
• 165
Qwen/Qwen2-Audio-7B-Instruct
Audio-Text-to-Text
• Updated • 361k
• 526
1bitLLM/bitnet_b1_58-large
Text Generation
• 0.7B • Updated • 1.46k
• 119
SpectraSuite/TriLM_3.9B_Unpacked
Text Generation
• 4B • Updated • 11
• 13
deepseek-ai/DeepSeek-V2.5
Text Generation
• 236B • Updated • 5.8k
• 733
upstage/solar-pro-preview-pretrained
Text Generation
• 22B • Updated • 61
Text Generation
• 0.5B • Updated • 1.83M
• 388
Qwen/Qwen2.5-0.5B-Instruct
Text Generation
• 0.5B • Updated • 5.61M
• 488
Text Generation
• 2B • Updated • 734k
• • 172
Qwen/Qwen2.5-1.5B-Instruct
Text Generation
• 2B • Updated • 9.63M
• • 650
Text Generation
• 3B • Updated • 480k
• 174
Text Generation
• 3B • Updated • 7.62M
• 427
Text Generation
• 8B • Updated • 1.02M
• • 266
Text Generation
• 8B • Updated • 17.4M
• • 1.17k
Text Generation
• 15B • Updated • 144k
• • 146
Qwen/Qwen2.5-14B-Instruct
Text Generation
• Updated • 1.31M
• • 325
Text Generation
• 33B • Updated • 1.68M
• • 173
Qwen/Qwen2.5-32B-Instruct
Text Generation
• Updated • 4.26M
• • 341
Text Generation
• 73B • Updated • 58.1k
• • 94
Qwen/Qwen2.5-72B-Instruct
Text Generation
• 73B • Updated • 691k
• • 923
Text Generation
• 1B • Updated • 1.81M
• 2.35k
Text Generation
• 3B • Updated • 1.17M
• 718
meta-llama/Llama-3.2-1B-Instruct
Text Generation
• 1B • Updated • 4.11M
• • 1.34k
meta-llama/Llama-3.2-3B-Instruct
Text Generation
• 3B • Updated • 7.66M
• • 2.07k
meta-llama/Llama-Guard-3-1B
Text Generation
• 1B • Updated • 64.4k
• 103
Dongwei/Rationalyst_reasoning_datasets
Text Generation
• 8B • Updated • 35
• 4
7B • Updated • 196
• 114
arcee-ai/SuperNova-Medius
Text Generation
• Updated • 104
• • 218
ibm-granite/granite-3.0-8b-instruct
Text Generation
• Updated • 27.3k
• 206
ibm-granite/granite-3.0-8b-base
Text Generation
• 8B • Updated • 1.79k
• 26
ibm-granite/granite-3.0-2b-instruct
Text Generation
• 3B • Updated • 6.35k
• 49
ibm-granite/granite-3.0-2b-base
Text Generation
• 3B • Updated • 3.22k
• 24
ibm-granite/granite-3.0-3b-a800m-instruct
Text Generation
• 3B • Updated • 2.44k
• 20
ibm-granite/granite-3.0-3b-a800m-base
Text Generation
• 3B • Updated • 1.49k
• 5
ibm-granite/granite-3.0-1b-a400m-instruct
Text Generation
• 1B • Updated • 1.26k
• 20
ibm-granite/granite-3.0-1b-a400m-base
Text Generation
• 1B • Updated • 7.63k
• 6
Text Generation
• 3B • Updated • 74
• 20