Causal LMs, seq2seq models
updated
0.8B • Updated • 500k
• 888
Text Generation
• 9B • Updated • 18.9k
• 1.25k
13B • Updated • 4.22k
• 665
Narrativa/mT5-base-finetuned-tydiQA-question-generation
Updated • 10
• 16
Text Generation
• 7B • Updated • 9.24k
• • 118
Text Generation
• Updated • 933
• • 668
tiiuae/falcon-40b-instruct
Text Generation
• Updated • 9.08k
• 1.18k
prometheus-eval/prometheus-13b-v1.0
Text Generation
• Updated • 320
• • 145
mistralai/Mixtral-8x7B-Instruct-v0.1
47B • Updated • 525k
• 4.7k
HuggingFaceH4/zephyr-7b-gemma-v0.1
Text Generation
• 9B • Updated • 130
• • 125
CohereLabs/c4ai-command-r-v01
Text Generation
• 35B • Updated • 25.1k
• 1.11k
HuggingFaceH4/starchat2-15b-v0.1
16B • Updated • 238
• 112
Text Generation
• 14B • Updated • 12k
• 112
CohereLabs/c4ai-command-r-plus
Text Generation
• 104B • Updated • 4.82k
• 1.8k
mistral-community/Mixtral-8x22B-v0.1
Text Generation
• 141B • Updated • 86
• 672
mistralai/Codestral-22B-v0.1
22B • Updated • 16.7k
• 1.33k
mistralai/Mistral-7B-Instruct-v0.3
7B • Updated • 3.42M
• 2.67k
Text Generation
• 8B • Updated • 45k
• • 171
meta-llama/Llama-3.1-8B-Instruct
Text Generation
• 8B • Updated • 9.87M
• • 6.19k
mistralai/Mistral-Large-Instruct-2407
123B • Updated • 4.61k
• 863
RLHFlow/Llama3.1-8B-PRM-Deepseek-Data
Text Generation
• 8B • Updated • 4.26k
• • 39
deepseek-ai/DeepSeek-V3-Base
685B • Updated • 9.28k
• 1.7k
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation
• 8B • Updated • 293k
• • 870
Text Generation
• 3B • Updated • 680k
• 981
allenai/Llama-3.1-Tulu-3-405B
Text Generation
• Updated • 725
• 112