Learning from the Self-future: On-policy Self-distillation for dLLMs Paper • 2606.18195 • Published 17 days ago • 76
Skill-3D: Evolving Scene-Aware Skills for Agentic 3D Spatial Reasoning Paper • 2606.07436 • Published 27 days ago • 25
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models Paper • 2606.11025 • Published 24 days ago • 41
Language-based Trial and Error Falls Behind in the Era of Experience Paper • 2601.21754 • Published Jan 29 • 16
VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning Paper • 2601.22069 • Published Jan 29 • 7
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published Jan 14 • 128
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published Dec 3, 2025 • 160
Running on Zero Agents Featured 478 DeepSeek OCR 2 Demo 🚀 478 Try out DeepSeek-OCR-2 on your PDFs or images