DeepSeek
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference
DeepSeek-OCR 2: Visual Causal Flow
-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • 685B • Updated • 209k • • 989 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 323 • 67 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 2.89M • • 1.45k -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • 685B • Updated • 1.94k • • 710
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 7.14M • • 13.4k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 5.58k • 958 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 260k • • 780 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 776k • • 1.57k
DeepSeek Math series
-
deepseek-ai/DeepSeek-Math-V2
Text Generation • 685B • Updated • 679 • 699 -
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 4.18k • 152 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 2.56k • 97 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 3.33k • 89
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
-
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation • 236B • Updated • 3.51k • 178 -
deepseek-ai/DeepSeek-V2-Chat
Text Generation • 236B • Updated • 13.9k • 462 -
deepseek-ai/DeepSeek-V2
Text Generation • 236B • Updated • 5.51k • 334 -
deepseek-ai/DeepSeek-V2-Lite
Text Generation • 16B • Updated • 420k • 180
models for paper expert-specialized fine-tuning
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 6.64k • 577 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 325k • 500 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 591k • 156 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • 1B • Updated • 49.9k • 169
-
Chat with DeepSeek-VL2-small
🌍608Chat with an AI using text and images for visual answers
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 823k • 248 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 6.05k • 179 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • 27B • Updated • 4.52k • 387
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 4.81k • 689 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 1.42k • 81 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 3.15k • 113 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 1.17M • • 614
DeepSeek-VL model series
DeepSeek LLM series
DeepSeek MoE series
-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • 685B • Updated • 209k • • 989 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 323 • 67 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 2.89M • • 1.45k -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • 685B • Updated • 1.94k • • 710
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 7.14M • • 13.4k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 5.58k • 958 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 260k • • 780 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 776k • • 1.57k
DeepSeek Math series
-
deepseek-ai/DeepSeek-Math-V2
Text Generation • 685B • Updated • 679 • 699 -
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 4.18k • 152 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 2.56k • 97 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 3.33k • 89
-
Chat with DeepSeek-VL2-small
🌍608Chat with an AI using text and images for visual answers
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 823k • 248 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 6.05k • 179 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • 27B • Updated • 4.52k • 387
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation • 236B • Updated • 3.51k • 178 -
deepseek-ai/DeepSeek-V2-Chat
Text Generation • 236B • Updated • 13.9k • 462 -
deepseek-ai/DeepSeek-V2
Text Generation • 236B • Updated • 5.51k • 334 -
deepseek-ai/DeepSeek-V2-Lite
Text Generation • 16B • Updated • 420k • 180
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 4.81k • 689 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 1.42k • 81 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 3.15k • 113 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 1.17M • • 614
models for paper expert-specialized fine-tuning
DeepSeek-VL model series
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 6.64k • 577 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 325k • 500 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 591k • 156 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • 1B • Updated • 49.9k • 169
DeepSeek LLM series
DeepSeek MoE series