ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads? Paper • 2602.19594 • Published Feb 23 • 3
Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published about 1 month ago • 22
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models Paper • 2601.14004 • Published Jan 20 • 48
💧 LFM2.5 Collection Collection of post-trained and base LFM2.5 models. • 30 items • Updated Apr 8 • 135
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence Paper • 2604.24954 • Published 13 days ago • 21
Crystalite: A Lightweight Transformer for Efficient Crystal Modeling Paper • 2604.02270 • Published Apr 2 • 1
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published Mar 26 • 52
view article Article SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation Mar 23 • 17
Self-Improving Pretraining: using post-trained models to pretrain better models Paper • 2601.21343 • Published Jan 29 • 19
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 Mar 10 • 145
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published Jan 26 • 42
Aya Datasets Collection The Aya Collection is a massive multilingual collection for over 100 languages consisting of 513 million instances of prompts and completions. • 5 items • Updated Jul 31, 2025 • 29
Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 65 items • Updated about 15 hours ago • 153