LingBot-VLA Collection Vision-Language-Action Foundation Model • 2 items • Updated 10 minutes ago • 1
The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text Paper • 2512.16924 • Published Dec 18, 2025 • 27
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation Paper • 2512.04678 • Published Dec 4, 2025 • 41
MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues Paper • 2512.03046 • Published Dec 2, 2025 • 12
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives Paper • 2510.20822 • Published Oct 23, 2025 • 41
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing Paper • 2308.07926 • Published Aug 15, 2023 • 28
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper • 2510.15742 • Published Oct 17, 2025 • 51
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models Paper • 2508.09138 • Published Aug 12, 2025 • 37
ScaleLSD: Scalable Deep Line Segment Detection Streamlined Paper • 2506.09369 • Published Jun 11, 2025 • 1
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views Paper • 2502.12138 • Published Feb 17, 2025
Level-S$^2$fM: Structure from Motion on Neural Level Set of Implicit Surfaces Paper • 2211.12018 • Published Nov 22, 2022 • 1