Sub-2b T2T LLMs - Finetunes - a d0zz0d Collection

d0zz0d 's Collections

Below Double-Digits - T2T LLMs

Below Double-Digits - IT2T VLMs

Sub-5b Parameter T2T LLMs

Sub-5b Parameter MM2T LLMs

Sub-5b T2T LLMs - Finetunes

Sub-2b Parameter T2T LLMs

Sub-2b Parameter IT2T VLMs

Sub-2b T2T LLMs - Finetunes

Sub-2b T2T LLMs - Historical

Sub-1b Parameter T2T LLMs

Sub-1b Parameter IT2T VLMs

Sub-1b T2T LLMs - Finetunes

Sub-1b T2T LLMs - Historical

Sub-2b T2T LLMs - Finetunes

updated Nov 29, 2025

A list of Finetuned LLMs, improving some or all aspects of a base LLM

SicariusSicariiStuff/Nano_Imp_1B

1B • Updated Nov 8, 2025 • 24 • 22

Note the smallest usable role-play enhanced language model.
nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

Text Generation • 2B • Updated Nov 21, 2025 • 1.88k • 234

Note a tiny reasoning model, from a trillion dollar company, who refuses to make GPUs cheaper...
nvidia/OpenReasoning-Nemotron-1.5B

Text Generation • 2B • Updated Sep 16, 2025 • 258 • 51

Note another tiny reasoning model, from a trillion dollar company, who refuses to make GPUs cheaper...
agentica-org/DeepScaleR-1.5B-Preview

Text Generation • 2B • Updated Apr 9, 2025 • 12.9k • 577

Note an improved version of the DeepSeek r1's tiniest distill
agentica-org/DeepCoder-1.5B-Preview

Text Generation • 2B • Updated Apr 9, 2025 • 134 • 71

Note an improved version of the DeepSeek r1's tiniest distill, with coding capabilities enhanced
Menlo/Lucy

Text Generation • 2B • Updated Aug 4, 2025 • 34 • 64

Note a finetune to enhance web search
Menlo/Lucy-128k

Text Generation • 2B • Updated Aug 4, 2025 • 219 • 108

Note a finetune to enhance web search, now with 128k tokens of context!
dnotitia/Smoothie-Qwen3-1.7B

Text Generation • 2B • Updated May 4, 2025 • 103 • 2

Note a finetune to smooth out token probability
DavidAU/Llama-3.2-1B-Instruct-NEO-SI-FI-GGUF

Text Generation • 1B • Updated Jul 28, 2025 • 549 • 10

Note a creative writing enhanced finetune, uncensored too!
huihui-ai/Huihui-MoE-1.5B-A0.6B-abliterated

Text Generation • 2B • Updated Jun 14, 2025 • 26

Note a tiny MoE model, built with a bunch of qwen3-0.6b experts. uncensored.
DavidAU/Gemma-3-1b-it-MAX-NEO-Imatrix-GGUF

Text Generation • 1.0B • Updated Jul 28, 2025 • 692 • 5

Note a creative writing enhanced finetune, uncensored too!
Goekdeniz-Guelmez/Josiefied-Qwen3-1.7B-abliterated-v1

Text Generation • 2B • Updated Aug 11, 2025 • 1.09k • 6

Note a better abliteration of qwen3-1.7b
janhq/Jan-v1-edge

Text Generation • 2B • Updated Sep 4, 2025 • 58 • 39

Note a finetune to enhance web search
POLARIS-Project/Polaris-1.7B-Preview

2B • Updated Jul 10, 2025 • 42 • 8

Note a finetune that greatly enhances math capabilities
WeiboAI/VibeThinker-1.5B

Text Generation • 2B • Updated Nov 24, 2025 • 2.38k • 508

Note a VERY powerful (and token consuming) Language model!
nvidia/DLER-R1-1.5B-Research

2B • Updated Oct 25, 2025 • 10.7k • 17
nvidia/AceMath-1.5B-Instruct

Text Generation • 2B • Updated Jan 17, 2025 • 1.2k • 15