Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis Paper β’ 2601.14253 β’ Published 8 days ago β’ 9
V-DPM: 4D Video Reconstruction with Dynamic Point Maps Paper β’ 2601.09499 β’ Published 14 days ago β’ 9
UM-Text: A Unified Multimodal Model for Image Understanding Paper β’ 2601.08321 β’ Published 15 days ago β’ 8
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation Paper β’ 2601.03955 β’ Published 21 days ago β’ 3
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation Paper β’ 2512.24724 β’ Published 28 days ago β’ 7
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow Paper β’ 2512.24766 β’ Published 28 days ago β’ 9
Self-Evaluation Unlocks Any-Step Text-to-Image Generation Paper β’ 2512.22374 β’ Published Dec 26, 2025 β’ 17
What matters for Representation Alignment: Global Information or Spatial Structure? Paper β’ 2512.10794 β’ Published Dec 11, 2025 β’ 9
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos Paper β’ 2512.10881 β’ Published Dec 11, 2025 β’ 29
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper β’ 2512.07843 β’ Published Nov 24, 2025 β’ 22
SyncMV4D: Synchronized Multi-view Joint Diffusion of Appearance and Motion for Hand-Object Interaction Synthesis Paper β’ 2511.19319 β’ Published Nov 24, 2025 β’ 2
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper β’ 2510.08697 β’ Published Oct 9, 2025 β’ 37
TTT3R: 3D Reconstruction as Test-Time Training Paper β’ 2509.26645 β’ Published Sep 30, 2025 β’ 15
UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections Paper β’ 2509.24817 β’ Published Sep 29, 2025 β’ 9
See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation Paper β’ 2509.22653 β’ Published Sep 26, 2025 β’ 25
view post Post 599 Qwen 3 Coder is a personal attack to k2, and I love it.It achieves near SOTA on LCB while not having reasoning.Finally people are understanding that reasoning isnt necessary for high benches...Qwen ftw!DECENTRALIZE DECENTRALIZE DECENTRALIZE See translation π 6 6 π₯ 4 4 + Reply
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding Paper β’ 2507.15028 β’ Published Jul 20, 2025 β’ 21