daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published 19 days ago • 124
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 65
What Generative Search Engines Like and How to Optimize Web Content Cooperatively Paper • 2510.11438 • Published Oct 13, 2025 • 11
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs Paper • 2507.03253 • Published Jul 4, 2025 • 19
OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling Paper • 2506.20512 • Published Jun 25, 2025 • 47
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research Paper • 2505.19253 • Published May 25, 2025 • 32