-
deepseek-ai/deepseek-llm-67b-chat
Text Generation • Updated • 3.12k • 206 -
deepseek-ai/deepseek-llm-7b-chat
Text Generation • Updated • 185k • 219 -
deepseek-ai/deepseek-llm-67b-base
Text Generation • Updated • 20.4k • 128 -
deepseek-ai/deepseek-llm-7b-base
Text Generation • Updated • 43.1k • 141
Collections
Discover the best community collections!
Collections trending this week
-
Lewdiculous/Model-Requests
Updated • 43 -
Lewdiculous/CaptainErisNebula-12B-Chimera-v1.1-GGUF-IQ-Imatrix
12B • Updated • 3.14k • 22 -
Lewdiculous/CaptainErisNebula-12B-AOE-v1-GGUF-IQ-Imatrix
12B • Updated • 68 • 5 -
Lewdiculous/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small-GGUF-IQ-Imatrix
8B • Updated • 1.87k • 16
-
AIRI-Institute/gena-lm-bert-large-t2t
Fill-Mask • 0.4B • Updated • 210 • 10 -
AIRI-Institute/gena-lm-bert-base-t2t
Fill-Mask • 0.1B • Updated • 1.75k • 3 -
AIRI-Institute/gena-lm-bigbird-base-t2t
Fill-Mask • Updated • 231 • 12 -
AIRI-Institute/gena-lm-bert-base-lastln-t2t
Fill-Mask • Updated • 13 • 2
-
Whisper
📉2.76kTranscribe audio files into text
-
Robust Speech Recognition via Large-Scale Weak Supervision
Paper • 2212.04356 • Published • 53 -
openai/whisper-large-v2
Automatic Speech Recognition • 2B • Updated • 70.1k • 1.79k -
openai/whisper-large
Automatic Speech Recognition • 2B • Updated • 84.6k • 542
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 7.06k • 566 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 138k • 490 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 33.5k • 149 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • Updated • 65.7k • 160
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 153 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
nvidia/parakeet-rnnt-1.1b
Automatic Speech Recognition • Updated • 1.15k • 167 -
nvidia/parakeet-ctc-1.1b
Automatic Speech Recognition • 1B • Updated • 866k • 46 -
nvidia/parakeet-rnnt-0.6b
Automatic Speech Recognition • Updated • 1.83k • 12 -
nvidia/parakeet-ctc-0.6b
Automatic Speech Recognition • 0.6B • Updated • 24k • 26
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 40 -
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Paper • 2310.08491 • Published • 57 -
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
Paper • 2411.04282 • Published • 37 -
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Paper • 2411.14432 • Published • 25
-
deepseek-ai/deepseek-llm-67b-chat
Text Generation • Updated • 3.12k • 206 -
deepseek-ai/deepseek-llm-7b-chat
Text Generation • Updated • 185k • 219 -
deepseek-ai/deepseek-llm-67b-base
Text Generation • Updated • 20.4k • 128 -
deepseek-ai/deepseek-llm-7b-base
Text Generation • Updated • 43.1k • 141
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 7.06k • 566 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 138k • 490 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 33.5k • 149 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • Updated • 65.7k • 160
-
Lewdiculous/Model-Requests
Updated • 43 -
Lewdiculous/CaptainErisNebula-12B-Chimera-v1.1-GGUF-IQ-Imatrix
12B • Updated • 3.14k • 22 -
Lewdiculous/CaptainErisNebula-12B-AOE-v1-GGUF-IQ-Imatrix
12B • Updated • 68 • 5 -
Lewdiculous/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small-GGUF-IQ-Imatrix
8B • Updated • 1.87k • 16
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 153 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
AIRI-Institute/gena-lm-bert-large-t2t
Fill-Mask • 0.4B • Updated • 210 • 10 -
AIRI-Institute/gena-lm-bert-base-t2t
Fill-Mask • 0.1B • Updated • 1.75k • 3 -
AIRI-Institute/gena-lm-bigbird-base-t2t
Fill-Mask • Updated • 231 • 12 -
AIRI-Institute/gena-lm-bert-base-lastln-t2t
Fill-Mask • Updated • 13 • 2
-
nvidia/parakeet-rnnt-1.1b
Automatic Speech Recognition • Updated • 1.15k • 167 -
nvidia/parakeet-ctc-1.1b
Automatic Speech Recognition • 1B • Updated • 866k • 46 -
nvidia/parakeet-rnnt-0.6b
Automatic Speech Recognition • Updated • 1.83k • 12 -
nvidia/parakeet-ctc-0.6b
Automatic Speech Recognition • 0.6B • Updated • 24k • 26
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 40 -
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Paper • 2310.08491 • Published • 57 -
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
Paper • 2411.04282 • Published • 37 -
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Paper • 2411.14432 • Published • 25
-
Whisper
📉2.76kTranscribe audio files into text
-
Robust Speech Recognition via Large-Scale Weak Supervision
Paper • 2212.04356 • Published • 53 -
openai/whisper-large-v2
Automatic Speech Recognition • 2B • Updated • 70.1k • 1.79k -
openai/whisper-large
Automatic Speech Recognition • 2B • Updated • 84.6k • 542