Nemotron RAG Collection Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs ⢠9 items ⢠Updated 3 days ago ⢠80
view article Article Build an Agent That Thinks Like a Data Scientist: How We Hit #1 on DABStep with Reusable Tool Generation 2 days ago ⢠9
SkillNet: Create, Evaluate, and Connect AI Skills Paper ⢠2603.04448 ⢠Published 16 days ago ⢠79
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 ⢠15 items ⢠Updated 1 day ago ⢠15
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper ⢠2602.08354 ⢠Published Feb 9 ⢠261
MARS: Modular Agent with Reflective Search for Automated AI Research Paper ⢠2602.02660 ⢠Published Feb 2 ⢠65
Endless Terminals: Scaling RL Environments for Terminal Agents Paper ⢠2601.16443 ⢠Published Jan 23 ⢠18
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper ⢠2601.08763 ⢠Published Jan 13 ⢠148
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper ⢠2601.09667 ⢠Published Jan 14 ⢠91
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge Paper ⢠2601.08808 ⢠Published Jan 13 ⢠39
view article Article Building the Open Agent Ecosystem Together: Introducing OpenEnv +8 Oct 23, 2025 ⢠150
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M ⢠16 items ⢠Updated May 5, 2025 ⢠305
MMFormalizer: Multimodal Autoformalization in the Wild Paper ⢠2601.03017 ⢠Published Jan 6 ⢠105
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper ⢠2508.16153 ⢠Published Aug 22, 2025 ⢠160
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper ⢠2508.03680 ⢠Published Aug 5, 2025 ⢠137
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper ⢠2508.03694 ⢠Published Aug 5, 2025 ⢠52