NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models β’ 15 items β’ Updated 5 days ago β’ 247
view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 19 days ago β’ 182
RDT 2 Collection RDT 2, the sequel to RDT-1B, is the first foundation model that achieves zero-shot deployment on unseen embodiments for simple open-vocabulary tasks. β’ 4 items β’ Updated Sep 26, 2025 β’ 17
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper β’ 2508.03680 β’ Published Aug 5, 2025 β’ 138
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper β’ 2506.20920 β’ Published Jun 26, 2025 β’ 78
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale Paper β’ 2504.16030 β’ Published Apr 22, 2025 β’ 36
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper β’ 2504.01990 β’ Published Mar 31, 2025 β’ 305
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper β’ 2411.14405 β’ Published Nov 21, 2024 β’ 61
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 16 items β’ Updated May 5, 2025 β’ 305
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper β’ 2410.05993 β’ Published Oct 8, 2024 β’ 111
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models Paper β’ 2309.03883 β’ Published Sep 7, 2023 β’ 36
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper β’ 2307.01952 β’ Published Jul 4, 2023 β’ 91