PyVision-RL: Forging Open Agentic Vision Models via RL Paper • 2602.20739 • Published 18 days ago • 29
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published 27 days ago • 52
Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents Paper • 2602.16855 • Published 27 days ago • 48
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published 21 days ago • 30
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 217
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published Jan 14 • 33
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published Jan 8 • 169
VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction Paper • 2601.05966 • Published Jan 9 • 23