Open-Models - a th3nolo Collection

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.05M • • 4.91k

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published Dec 23, 2025 • 62

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Paper • 2512.15560 • Published Dec 17, 2025 • 25

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published Dec 27, 2025 • 51

tencent/HY-Motion-1.0

Text-to-3D • Updated Dec 31, 2025 • 482 • 417

Lightricks/LTX-2

Image-to-Video • Updated Mar 2 • 557k • • 1.75k

LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR

Paper • 2601.14251 • Published Jan 20 • 29

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published Jan 29 • 75

tencent/Youtu-VL-4B-Instruct

Image-Text-to-Text • 5B • Updated Feb 10 • 680 • 157

Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

Paper • 2601.21406 • Published Jan 29 • 6

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 50

DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published Jan 28 • 71

zai-org/GLM-OCR

Image-Text-to-Text • 1B • Updated May 19 • 3.18M • • 1.86k

unsloth/Qwen3-Coder-Next-FP8-Dynamic

Text Generation • 80B • Updated Feb 3 • 15.8k • 43

Qwen/Qwen3-Coder-Next

Text Generation • 80B • Updated Feb 3 • 1.17M • • 1.48k

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 62

Lightricks/LTX-2.3

Image-to-Video • Updated Apr 13 • 1.81M • 1.45k

mistralai/Leanstral-2603

Updated Apr 21 • 127 • 163

Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

Paper • 2603.13398 • Published Mar 11 • 155

Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF

Image-Text-to-Text • 4B • Updated Apr 6 • 16.3k • 134

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published Mar 17 • 101

YTan2000/Qwen3.5-27B-TQ3_1S

Image-Text-to-Text • 27B • Updated Apr 23 • 140 • 38

bartowski/arcee-ai_Trinity-Large-Thinking-GGUF

Text Generation • 399B • Updated Apr 1 • 1.79k • 12

zed-industries/zeta-2

Text Generation • 8B • Updated Mar 23 • 558 • 183

mudler/Qwen3.5-35B-A3B-APEX-GGUF

Text Generation • 35B • Updated Apr 27 • 19.7k • 91

Jackrong/Qwopus3.5-27B-v3

Image-Text-to-Text • 27B • Updated Apr 16 • 810 • 249

Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Paper • 2604.00528 • Published Apr 1 • 12

0xSero/Gemma-4-21B

Text Generation • 21B • Updated 26 days ago • 368 • 98

datalab-to/chandra-ocr-2

Image-Text-to-Text • 5B • Updated Mar 18 • 1.81M • 412

lightonai/LightOnOCR-2-1B

Image-Text-to-Text • 1B • Updated May 4 • 230k • 703

selimaktas/MiniMax-M2.75-460B-A20B

Text Generation • 453B • Updated Apr 21 • 33 • 26

Qwen/Qwen3.6-27B

Image-Text-to-Text • 28B • Updated Apr 24 • 5.69M • • 1.8k

XiaomiMiMo/MiMo-V2-Flash

Text Generation • 310B • Updated Apr 20 • 69.9k • • 741

openai/privacy-filter

Token Classification • 1B • Updated Apr 22 • 302k • • 1.67k

concavity-ai/superlinear-exp-v0.1

Text Generation • 32B • Updated Feb 6 • 17 • 22

openbmb/InfLLM-V2-Long-Sparse-Base

8B • Updated Dec 1, 2025 • 35 • 7

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • 685B • Updated Nov 18, 2025 • 209k • • 989

HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention

Paper • 2603.28458 • Published Mar 30 • 44

oongaboongahacker/Gemini-Nano

Updated Jun 25, 2024 • 42

Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings

Paper • 2605.22391 • Published May 21 • 38

Kaikaku/epicure-cooc

Feature Extraction • Updated 29 days ago • 2.96k • 34

unsloth/gemma-4-12b-it-GGUF

Image-Text-to-Text • 12B • Updated 16 days ago • 1.32M • 695

nex-agi/Nex-N2-Pro

Text Generation • 397B • Updated 14 days ago • 8.12k • 350

MiniMaxAI/MiniMax-M3

Image-Text-to-Text • 427B • Updated 2 days ago • 154k • • 1.23k

DiffusionGemma vs Gemma-4 — Post-OCR Correction

📰

19

Diffusion vs autoregressive LLM on historical OCR cleanup

CohereLabs/North-Mini-Code-1.0

Text Generation • 30B • Updated 10 days ago • 26k • 488

microsoft/FastContext-1.0-4B-SFT

Text Generation • 4B • Updated 8 days ago • 5.28k • • 339

SupraLabs/Supra-1.5-50M-Instruct-exp

Text Generation • 51.8M • Updated 13 days ago • 1.4k • 46

zai-org/GLM-5.2

Text Generation • 753B • Updated 2 days ago • 67.1k • • 2.41k

SupraLabs/Supra-A2A-Nano-Exp

Any-to-Any • 29.7M • Updated 4 days ago • 26

baidu/Unlimited-OCR

Image-Text-to-Text • 3B • Updated about 23 hours ago • 70.7k • 816

docling-project/DocLayNet

Updated Jan 25, 2023 • 674 • 140