view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 7 days ago • 21
view article Article Introducing OptiMind, a research model designed for optimization 11 days ago • 31
view article Article Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem 21 days ago • 17
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 116
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published Dec 11, 2025 • 49
view article Article Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model 25 days ago • 17
SpecBundle Collection A collection of production-grade draft models for speculative decoding • 15 items • Updated 14 days ago • 15
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published Dec 15, 2025 • 74
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation Paper • 2512.16913 • Published Dec 18, 2025 • 34
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated Dec 23, 2025 • 45
Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale Paper • 2512.10398 • Published Dec 11, 2025 • 11
Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens Paper • 2511.19418 • Published Nov 24, 2025 • 29
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 • 63