UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling Paper โข 2604.19734 โข Published 15 days ago โข 29
ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts Paper โข 2507.20939 โข Published Jul 28, 2025 โข 57
GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning Paper โข 2506.16141 โข Published Jun 19, 2025 โข 27
GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning Paper โข 2506.16141 โข Published Jun 19, 2025 โข 27
AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation Paper โข 2506.03126 โข Published Jun 3, 2025 โข 22
Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning? Paper โข 2505.21374 โข Published May 27, 2025 โข 28
Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning? Paper โข 2505.21374 โข Published May 27, 2025 โข 28
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Paper โข 2504.01014 โข Published Apr 1, 2025 โข 70
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Paper โข 2504.01014 โข Published Apr 1, 2025 โข 70
Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Paper โข 2503.24376 โข Published Mar 31, 2025 โข 38
Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Paper โข 2503.24376 โข Published Mar 31, 2025 โข 38
GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers Paper โข 2503.19480 โข Published Mar 25, 2025 โข 16
GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers Paper โข 2503.19480 โข Published Mar 25, 2025 โข 16
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Paper โข 2412.04432 โข Published Dec 5, 2024 โข 16
Moto: Latent Motion Token as the Bridging Language for Robot Manipulation Paper โข 2412.04445 โข Published Dec 5, 2024 โข 22
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation Paper โข 2409.04410 โข Published Sep 6, 2024 โข 25