LLMs
updated
ContextualAI/Contextual_KTO_Mistral_PairRM
Text Generation
•
7B
•
Updated
•
59
•
32
snorkelai/Snorkel-Mistral-PairRM-DPO
Text Generation
•
Updated
•
581
•
108
state-spaces/mamba-2.8b-hf
Text Generation
•
3B
•
Updated
•
5.28k
•
111
h2oai/h2o-danube-1.8b-base
Text Generation
•
2B
•
Updated
•
68
•
43
Text Generation
•
9B
•
Updated
•
9.47k
•
187
NousResearch/Genstruct-7B
Text Generation
•
7B
•
Updated
•
119
•
402
AetherResearch/Cerebrum-1.0-7b
Text Generation
•
7B
•
Updated
•
44
•
•
51
abacusai/Liberated-Qwen1.5-72B
Text Generation
•
Updated
•
85
•
100
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
7B
•
Updated
•
3.68k
•
243
Crystalcareai/Gemma-7b-Fixed
Text Generation
•
9B
•
Updated
•
11
•
3
openchat/openchat-3.5-0106-gemma
Text Generation
•
9B
•
Updated
•
2.29k
•
57
openchat/openchat-3.5-0106
Text Generation
•
7B
•
Updated
•
11.7k
•
360
HuggingFaceH4/zephyr-7b-gemma-sft-v0.1
Text Generation
•
9B
•
Updated
•
111
•
12
HuggingFaceH4/zephyr-7b-gemma-v0.1
Text Generation
•
9B
•
Updated
•
237
•
124
ibm-research/merlinite-7b
Text Generation
•
7B
•
Updated
•
51
•
105
Text Generation
•
Updated
•
130
•
41
316B
•
Updated
•
39.3k
•
70
Feature Extraction
•
7B
•
Updated
•
10
•
21
Feature Extraction
•
7B
•
Updated
•
5
•
7
Text Generation
•
Updated
•
55
•
6
Text Generation
•
9B
•
Updated
•
1.94k
•
251
CohereLabs/c4ai-command-r-plus
Text Generation
•
104B
•
Updated
•
2.25k
•
1.77k
h2oai/h2o-danube2-1.8b-base
Text Generation
•
2B
•
Updated
•
154
•
47
Text Generation
•
9B
•
Updated
•
19k
•
276
stabilityai/stablelm-2-12b
Text Generation
•
12B
•
Updated
•
338
•
120
stabilityai/stablelm-2-12b-chat
Text Generation
•
12B
•
Updated
•
248
•
88
google/recurrentgemma-2b-it
Text Generation
•
3B
•
Updated
•
2.64k
•
111
Text Generation
•
3B
•
Updated
•
3.63k
•
94
mistral-community/Mixtral-8x22B-v0.1
Text Generation
•
141B
•
Updated
•
251
•
672
meta-llama/Meta-Llama-3-8B
Text Generation
•
8B
•
Updated
•
1.86M
•
•
6.46k
meta-llama/Meta-Llama-3-8B-Instruct
Text Generation
•
8B
•
Updated
•
1.46M
•
•
4.38k
meta-llama/Meta-Llama-3-70B
Text Generation
•
71B
•
Updated
•
198k
•
•
871
meta-llama/Meta-Llama-3-70B-Instruct
Text Generation
•
71B
•
Updated
•
58.8k
•
•
1.51k
Snowflake/snowflake-arctic-instruct
Text Generation
•
479B
•
Updated
•
7.04k
•
360
nvidia/Llama3-ChatQA-1.5-8B
Text Generation
•
8B
•
Updated
•
8.91k
•
553
Text Generation
•
236B
•
Updated
•
3.8k
•
333
deepseek-ai/DeepSeek-V2-Chat
Text Generation
•
236B
•
Updated
•
8.22k
•
460
Text Generation
•
7B
•
Updated
•
1.01k
•
14
Text Generation
•
11B
•
Updated
•
7.76k
•
218
Text Generation
•
6B
•
Updated
•
11k
•
31
Text Generation
•
6B
•
Updated
•
6.38k
•
41
Text Generation
•
9B
•
Updated
•
10.8k
•
52
Text Generation
•
9B
•
Updated
•
18k
•
147
Text Generation
•
34B
•
Updated
•
7.22k
•
48
Text Generation
•
34B
•
Updated
•
10.5k
•
273
Fugaku-LLM/Fugaku-LLM-13B
Text Generation
•
13B
•
Updated
•
130
Fugaku-LLM/Fugaku-LLM-13B-instruct
Text Generation
•
13B
•
Updated
•
75
•
28
NousResearch/Hermes-2-Theta-Llama-3-8B
Text Generation
•
8B
•
Updated
•
8.26k
•
•
204
prometheus-eval/prometheus-7b-v2.0
Text Generation
•
7B
•
Updated
•
17.8k
•
100
prometheus-eval/prometheus-8x7b-v2.0
Text Generation
•
47B
•
Updated
•
338
•
49
microsoft/Phi-3-medium-4k-instruct
Text Generation
•
14B
•
Updated
•
8.03k
•
224
microsoft/Phi-3-medium-128k-instruct
Text Generation
•
14B
•
Updated
•
11.7k
•
387
microsoft/Phi-3-small-128k-instruct
Text Generation
•
7B
•
Updated
•
893
•
181
microsoft/Phi-3-small-8k-instruct
Text Generation
•
7B
•
Updated
•
12.6k
•
175
mistralai/Mistral-7B-v0.3
7B
•
Updated
•
74.1k
•
560
mistralai/Mistral-7B-Instruct-v0.3
7B
•
Updated
•
1.12M
•
2.4k
nvidia/Llama3-ChatQA-1.5-70B
Text Generation
•
71B
•
Updated
•
196
•
•
333
Text Generation
•
8B
•
Updated
•
14.7k
•
428
Text Generation
•
35B
•
Updated
•
3.26k
•
291
Text Generation
•
73B
•
Updated
•
28.5k
•
•
719
Text Generation
•
73B
•
Updated
•
32k
•
•
200
nvidia/Nemotron-4-340B-Base
Updated
•
199
•
147
nvidia/Nemotron-4-340B-Instruct
Updated
•
78
•
693
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
Text Generation
•
8B
•
Updated
•
7.68k
•
•
83
instruction-pretrain/instruction-synthesizer
Text Generation
•
7B
•
Updated
•
47
•
79
Text Generation
•
9B
•
Updated
•
54.9k
•
•
689
Text Generation
•
9B
•
Updated
•
136k
•
•
766
Text Generation
•
27B
•
Updated
•
6.51k
•
210
Text Generation
•
27B
•
Updated
•
374k
•
559
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation
•
236B
•
Updated
•
2.92k
•
177
mistralai/Mistral-Nemo-Base-2407
12B
•
Updated
•
22.1k
•
340
mistralai/Mistral-Nemo-Instruct-2407
12B
•
Updated
•
80.6k
•
1.65k
mistralai/Mamba-Codestral-7B-v0.1
7B
•
Updated
•
22.8k
•
610
mistralai/Mathstral-7B-v0.1
7B
•
Updated
•
5.1k
•
237
HuggingFaceTB/SmolLM-135M
Text Generation
•
0.1B
•
Updated
•
237k
•
244
HuggingFaceTB/SmolLM-360M
Text Generation
•
0.4B
•
Updated
•
35.4k
•
69
HuggingFaceTB/SmolLM-1.7B
Text Generation
•
2B
•
Updated
•
61.9k
•
179
HuggingFaceTB/SmolLM-135M-Instruct
Text Generation
•
0.1B
•
Updated
•
78.6k
•
131
HuggingFaceTB/SmolLM-360M-Instruct
Text Generation
•
0.4B
•
Updated
•
23.8k
•
83
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
•
2B
•
Updated
•
5.85k
•
117
7B
•
Updated
•
607
•
833
Text Generation
•
Updated
•
959
•
136
Text Generation
•
73B
•
Updated
•
582
•
51
Text Generation
•
8B
•
Updated
•
313
•
•
87
Feature Extraction
•
8B
•
Updated
•
64
•
15
Text Generation
•
Updated
•
8.84k
•
69
meta-llama/Llama-Guard-3-8B
Text Generation
•
8B
•
Updated
•
41.4k
•
•
264
meta-llama/Llama-3.1-8B-Instruct
Text Generation
•
8B
•
Updated
•
5.59M
•
•
5.44k
Text Generation
•
8B
•
Updated
•
1.14M
•
•
2.06k
meta-llama/Prompt-Guard-86M
Text Classification
•
0.3B
•
Updated
•
24.7k
•
•
308
meta-llama/Llama-3.1-405B-Instruct
Text Generation
•
406B
•
Updated
•
143k
•
•
591
meta-llama/Llama-3.1-405B
Text Generation
•
406B
•
Updated
•
496k
•
958
Text Generation
•
3B
•
Updated
•
154k
•
623
Text Generation
•
3B
•
Updated
•
310k
•
•
1.28k
Text Generation
•
3B
•
Updated
•
283
•
78
internlm/internlm2_5-20b-chat
Text Generation
•
20B
•
Updated
•
630
•
92
internlm/internlm2_5-7b-chat
Text Generation
•
8B
•
Updated
•
26.9k
•
199
internlm/internlm2_5-7b-chat-1m
Text Generation
•
8B
•
Updated
•
230
•
72
internlm/internlm2_5-1_8b-chat
Text Generation
•
2B
•
Updated
•
1.67k
•
25
Text Generation
•
20B
•
Updated
•
75
•
17
Text Generation
•
8B
•
Updated
•
3.83k
•
18
internlm/internlm2_5-1_8b
Text Generation
•
Updated
•
298
•
24
LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
Text Generation
•
8B
•
Updated
•
17.1k
•
416
Image-Text-to-Text
•
8B
•
Updated
•
89k
•
1.03k
microsoft/Phi-3.5-mini-instruct
Text Generation
•
4B
•
Updated
•
380k
•
963
microsoft/Phi-3.5-MoE-instruct
Text Generation
•
42B
•
Updated
•
104k
•
570
Audio-Text-to-Text
•
8B
•
Updated
•
26.4k
•
158
Qwen/Qwen2-Audio-7B-Instruct
Audio-Text-to-Text
•
8B
•
Updated
•
573k
•
519
1bitLLM/bitnet_b1_58-large
Text Generation
•
0.7B
•
Updated
•
969
•
113
SpectraSuite/TriLM_3.9B_Unpacked
Text Generation
•
4B
•
Updated
•
8
•
13
deepseek-ai/DeepSeek-V2.5
Text Generation
•
236B
•
Updated
•
9.88k
•
733
upstage/solar-pro-preview-pretrained
Text Generation
•
22B
•
Updated
•
60
Text Generation
•
0.5B
•
Updated
•
1.08M
•
371
Qwen/Qwen2.5-0.5B-Instruct
Text Generation
•
0.5B
•
Updated
•
4.26M
•
462
Text Generation
•
2B
•
Updated
•
420k
•
•
163
Qwen/Qwen2.5-1.5B-Instruct
Text Generation
•
2B
•
Updated
•
6.93M
•
•
613
Text Generation
•
3B
•
Updated
•
231k
•
168
Text Generation
•
3B
•
Updated
•
9.86M
•
402
Text Generation
•
8B
•
Updated
•
1.08M
•
•
263
Text Generation
•
8B
•
Updated
•
10.3M
•
•
1.07k
Text Generation
•
15B
•
Updated
•
131k
•
•
142
Qwen/Qwen2.5-14B-Instruct
Text Generation
•
15B
•
Updated
•
1.89M
•
•
310
Text Generation
•
33B
•
Updated
•
100k
•
•
172
Qwen/Qwen2.5-32B-Instruct
Text Generation
•
33B
•
Updated
•
4.37M
•
•
327
Text Generation
•
73B
•
Updated
•
20.2k
•
•
87
Qwen/Qwen2.5-72B-Instruct
Text Generation
•
73B
•
Updated
•
265k
•
•
907
Text Generation
•
1B
•
Updated
•
2.42M
•
2.29k
Text Generation
•
3B
•
Updated
•
739k
•
696
meta-llama/Llama-3.2-1B-Instruct
Text Generation
•
1B
•
Updated
•
2.91M
•
•
1.29k
meta-llama/Llama-3.2-3B-Instruct
Text Generation
•
3B
•
Updated
•
2.18M
•
•
1.98k
meta-llama/Llama-Guard-3-1B
Text Generation
•
1B
•
Updated
•
83.6k
•
100
Dongwei/Rationalyst_reasoning_datasets
Text Generation
•
8B
•
Updated
•
60
•
4
7B
•
Updated
•
59
•
114
arcee-ai/SuperNova-Medius
Text Generation
•
15B
•
Updated
•
118
•
•
218
ibm-granite/granite-3.0-8b-instruct
Text Generation
•
8B
•
Updated
•
16.7k
•
205
ibm-granite/granite-3.0-8b-base
Text Generation
•
8B
•
Updated
•
1.79k
•
25
ibm-granite/granite-3.0-2b-instruct
Text Generation
•
3B
•
Updated
•
4.1k
•
47
ibm-granite/granite-3.0-2b-base
Text Generation
•
3B
•
Updated
•
2.66k
•
23
ibm-granite/granite-3.0-3b-a800m-instruct
Text Generation
•
3B
•
Updated
•
1.4k
•
20
ibm-granite/granite-3.0-3b-a800m-base
Text Generation
•
3B
•
Updated
•
1.18k
•
5
ibm-granite/granite-3.0-1b-a400m-instruct
Text Generation
•
1B
•
Updated
•
214
•
20
ibm-granite/granite-3.0-1b-a400m-base
Text Generation
•
1B
•
Updated
•
2.82k
•
6
Text Generation
•
3B
•
Updated
•
114
•
20