Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
gaunernst
's Collections
DeepSeek testing
Gemma 3 QAT INT4 (from GGUF)
Gemma 3 QAT INT4 (from Flax)
Mini BERT models
Face Recognition Models
LLMs < 1B
LLMs 1B - 2B
LLMs 2B - 4B
Smallish LLM pre-training datasets
Llama2-compatible
Llama3-compatible
DeepSeek testing
updated
Apr 10, 2025
A collection of MoE+MLA models, serving as testing proxies for DeepSeek-V3/R1
Upvote
-
deepseek-ai/DeepSeek-V2-Lite-Chat
Text Generation
•
16B
•
Updated
Jun 25, 2024
•
387k
•
135
gaunernst/DeepSeek-V2-Lite-Chat-FP8
16B
•
Updated
Apr 7, 2025
•
5.75k
TechxGenus/DeepSeek-V2-Lite-Chat-AWQ
Text Generation
•
16B
•
Updated
Jul 4, 2024
•
124
•
2
deepseek-ai/DeepSeek-R1
Text Generation
•
685B
•
Updated
Mar 27, 2025
•
1.25M
•
•
13.1k
meituan/DeepSeek-R1-Block-INT8
Text Generation
•
Updated
Feb 27, 2025
•
794
•
49
meituan/DeepSeek-R1-Channel-INT8
Text Generation
•
685B
•
Updated
Feb 27, 2025
•
1.19k
•
32
QuixiAI/DeepSeek-V3-AWQ
Text Generation
•
Updated
Mar 29, 2025
•
1.2k
•
35
ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts
Text Generation
•
104B
•
Updated
Apr 8, 2025
•
6
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections