Manifold-Aware Exploration for Reinforcement Learning in Video Generation Paper β’ 2603.21872 β’ Published 7 days ago β’ 33
Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory Paper β’ 2602.02393 β’ Published Feb 2 β’ 17
iFSQ: Improving FSQ for Image Generation with 1 Line of Code Paper β’ 2601.17124 β’ Published Jan 23 β’ 33
PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence Paper β’ 2512.16793 β’ Published Dec 18, 2025 β’ 76
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation Paper β’ 2512.16913 β’ Published Dec 18, 2025 β’ 34
IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering Paper β’ 2506.23329 β’ Published Jun 29, 2025 β’ 8
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models Paper β’ 2511.00503 β’ Published Nov 1, 2025 β’ 2
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling Paper β’ 2512.03000 β’ Published Dec 2, 2025 β’ 37
JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization Paper β’ 2511.23002 β’ Published Nov 28, 2025 β’ 26
JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization Paper β’ 2511.23002 β’ Published Nov 28, 2025 β’ 26
JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization Paper β’ 2511.23002 β’ Published Nov 28, 2025 β’ 26 β’ 2
JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization Paper β’ 2511.23002 β’ Published Nov 28, 2025 β’ 26 β’ 2
Runtime error Featured 104 JarvisArt Preview π 104 Generate Lightroom presets using AI and image input