From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 5 days ago • 141
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4 Any-to-Any • 18B • Updated 2 days ago • 462k • 92
EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model Paper • 2604.10268 • Published 27 days ago • 12
view article Article Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents 9 days ago • 44
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 11 days ago • 116
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 22 days ago • 69
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling Paper • 2604.06916 • Published 30 days ago • 34
Running Featured 77 Distilling 100B+ Models 40x Faster with TRL 📝 77 TRL distillation for 100B+ teachers, 40x faster
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory Paper • 2604.01007 • Published Apr 2 • 31
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published Mar 26 • 132