Causal LMs, seq2seq models
updated
0.8B • Updated • 491k
• 888
Text Generation
• 9B • Updated • 18.3k
• • 1.25k
13B • Updated • 4.68k
• 665
Narrativa/mT5-base-finetuned-tydiQA-question-generation
Updated • 10
• 16
Text Generation
• 7B • Updated • 9.07k
• • 118
Text Generation
• Updated • 959
• • 668
tiiuae/falcon-40b-instruct
Text Generation
• Updated • 10.3k
• 1.18k
prometheus-eval/prometheus-13b-v1.0
Text Generation
• Updated • 340
• • 145
mistralai/Mixtral-8x7B-Instruct-v0.1
47B • Updated • 527k
• 4.7k
HuggingFaceH4/zephyr-7b-gemma-v0.1
Text Generation
• 9B • Updated • 128
• • 125
CohereLabs/c4ai-command-r-v01
Text Generation
• 35B • Updated • 25k
• 1.11k
HuggingFaceH4/starchat2-15b-v0.1
16B • Updated • 51
• 112
Text Generation
• 14B • Updated • 11.7k
• 112
CohereLabs/c4ai-command-r-plus
Text Generation
• 104B • Updated • 4.9k
• 1.8k
mistral-community/Mixtral-8x22B-v0.1
Text Generation
• 141B • Updated • 92
• 672
mistralai/Codestral-22B-v0.1
22B • Updated • 17.2k
• 1.33k
mistralai/Mistral-7B-Instruct-v0.3
7B • Updated • 3.23M
• 2.67k
Text Generation
• 8B • Updated • 41.9k
• • 171
meta-llama/Llama-3.1-8B-Instruct
Text Generation
• 8B • Updated • 10M
• • 6.19k
mistralai/Mistral-Large-Instruct-2407
123B • Updated • 4.57k
• 863
RLHFlow/Llama3.1-8B-PRM-Deepseek-Data
Text Generation
• 8B • Updated • 4.41k
• • 39
deepseek-ai/DeepSeek-V3-Base
685B • Updated • 8.09k
• 1.7k
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation
• 8B • Updated • 288k
• • 870
Text Generation
• 3B • Updated • 667k
• 981
allenai/Llama-3.1-Tulu-3-405B
Text Generation
• Updated • 740
• 112