view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models nvidia • Dec 15, 2025 • 113
view article Article Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI nvidia • Mar 17 • 67
Gemma 4 — DECKARD HERETIC, Multimodal & Speculators Collection Gemma 4 abliterated/quantized — DECKARD HERETIC 31B, SuperGemma4-26B multimodal, 26B-A4B MoE, plus EAGLE3/DFlash drafters. • 14 items • Updated 2 days ago • 8
HEBATRON: A Hebrew-Specialized Open-Weight Mixture-of-Experts Language Model Paper • 2605.11255 • Published May 11 • 1
Learn from your own latents and not from tokens: A sample-complexity theory Paper • 2605.27734 • Published May 26 • 2
TextCraftor: Your Text Encoder Can be Image Quality Controller Paper • 2403.18978 • Published Mar 27, 2024 • 15
Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks Paper • 2405.10122 • Published May 16, 2024 • 1
Making Multimodal Generation Easier: When Diffusion Models Meet LLMs Paper • 2310.08949 • Published Oct 13, 2023 • 2
Programmable-Room: Interactive Textured 3D Room Meshes Generation Empowered by Large Language Models Paper • 2506.17707 • Published Jun 21, 2025 • 1
Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis Paper • 2311.17126 • Published Nov 28, 2023 • 2
LaViDa: A Large Diffusion Language Model for Multimodal Understanding Paper • 2505.16839 • Published May 22, 2025 • 14
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models Paper • 2305.05189 • Published May 9, 2023 • 4
dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models Paper • 2512.19433 • Published Dec 22, 2025 • 4
DIFFA: Large Language Diffusion Models Can Listen and Understand Paper • 2507.18452 • Published Jul 24, 2025 • 2