Nemotron-TwoTower Collection Diffusion Language Modeling with Pretrained Autoregressive Nemotron 3 Models • 1 item • Updated 2 days ago • 4
Ornith-1.0 Collection Ornith-1.0 is a family of open-source LLMs specialized for agentic coding. • 8 items • Updated about 9 hours ago • 167
VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models Paper • 2606.16140 • Published 12 days ago • 119
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Paper • 2606.09079 • Published 19 days ago • 64
Gemma 4 QAT Collection Gemma 4 QAT (Quantization-Aware Training) for 3x less memory use and near original accuracy. • 16 items • Updated 12 days ago • 95
Domino Collection Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding • 3 items • Updated 5 days ago • 3
Qwen 3.x MTP Collection MLX MTP drafter checkpoints for Qwen 3.x speculative decoding with mlx-vlm. • 12 items • Updated 26 days ago • 9
REAP the Experts: Why Pruning Prevails for One-Shot MoE compression Paper • 2510.13999 • Published Oct 15, 2025 • 20
Efficient Agentic Reasoning Through Self-Regulated Simulative Planning Paper • 2605.22138 • Published May 21 • 11
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published May 13 • 165
SpecDrift Collection Models released as a part of Attention-Drift Paper, trained for deployment on production • 2 items • Updated May 10 • 2
view article Article Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents nvidia • Apr 28 • 62