DeepSeek testing - a gaunernst Collection

gaunernst 's Collections

DeepSeek testing

Gemma 3 QAT INT4 (from GGUF)

Gemma 3 QAT INT4 (from Flax)

Mini BERT models

Face Recognition Models

Smallish LLM pre-training datasets

Llama2-compatible

Llama3-compatible

DeepSeek testing

updated Apr 10, 2025

A collection of MoE+MLA models, serving as testing proxies for DeepSeek-V3/R1

deepseek-ai/DeepSeek-V2-Lite-Chat

Text Generation • 16B • Updated Jun 25, 2024 • 1.06M • 141
gaunernst/DeepSeek-V2-Lite-Chat-FP8

16B • Updated Apr 7, 2025 • 18.4k
TechxGenus/DeepSeek-V2-Lite-Chat-AWQ

Text Generation • 16B • Updated Jul 4, 2024 • 39 • 3
deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 7.2M • • 13.4k
meituan/DeepSeek-R1-Block-INT8

Text Generation • Updated Feb 27, 2025 • 1.95k • 50
meituan/DeepSeek-R1-Channel-INT8

Text Generation • 685B • Updated May 8 • 17 • 33
QuixiAI/DeepSeek-V3-AWQ

Text Generation • 671B • Updated Mar 29, 2025 • 414 • 35
ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts

Text Generation • 676B • Updated Apr 8, 2025 • 13 • 4