OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution Paper • 2601.20380 • Published 1 day ago • 3 • 1
Agentic Rubrics as Contextual Verifiers for SWE Agents Paper • 2601.04171 • Published 22 days ago • 11 • 2
Klear: Unified Multi-Task Audio-Video Joint Generation Paper • 2601.04151 • Published 22 days ago • 16 • 3
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published 24 days ago • 29 • 3
Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published Dec 29, 2025 • 6 • 3
Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience Paper • 2512.17260 • Published Dec 19, 2025 • 50 • 3
VideoVLA: Video Generators Can Be Generalizable Robot Manipulators Paper • 2512.06963 • Published Dec 7, 2025 • 4 • 2
SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling Paper • 2512.05343 • Published Dec 5, 2025 • 25 • 2
ProPhy: Progressive Physical Alignment for Dynamic World Simulation Paper • 2512.05564 • Published Dec 5, 2025 • 6 • 2
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published Dec 5, 2025 • 38 • 3
Self-Improving VLM Judges Without Human Annotations Paper • 2512.05145 • Published Dec 2, 2025 • 20 • 2
World Models That Know When They Don't Know: Controllable Video Generation with Calibrated Uncertainty Paper • 2512.05927 • Published Dec 5, 2025 • 12 • 2
SIMA 2: A Generalist Embodied Agent for Virtual Worlds Paper • 2512.04797 • Published Dec 4, 2025 • 25 • 2