SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning Paper • 2606.10804 • Published 2 days ago • 32
Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation Paper • 2605.19833 • Published 23 days ago • 134
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations Paper • 2512.05905 • Published Dec 5, 2025 • 21
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models Paper • 2501.02955 • Published Jan 6, 2025 • 44
Build error Agents 11 MotionBench Leaderboard 🐨 11 Submit and view leaderboard data for model evaluations
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer Paper • 2408.06072 • Published Aug 12, 2024 • 38
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer Paper • 2408.06072 • Published Aug 12, 2024 • 38
Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer Paper • 2405.04312 • Published May 7, 2024 • 1