LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 21 days ago • 135
EasyV2V: A High-quality Instruction-based Video Editing Framework Paper • 2512.16920 • Published Dec 18, 2025 • 18
CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives Paper • 2512.14696 • Published Dec 16, 2025 • 8
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding Paper • 2510.10868 • Published Oct 13, 2025 • 12
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding Paper • 2510.10868 • Published Oct 13, 2025 • 12
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding Paper • 2510.10868 • Published Oct 13, 2025 • 12 • 2
PickStyle: Video-to-Video Style Transfer with Context-Style Adapters Paper • 2510.07546 • Published Oct 8, 2025 • 22
PickStyle: Video-to-Video Style Transfer with Context-Style Adapters Paper • 2510.07546 • Published Oct 8, 2025 • 22
PickStyle: Video-to-Video Style Transfer with Context-Style Adapters Paper • 2510.07546 • Published Oct 8, 2025 • 22 • 2
MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network Paper • 2310.16288 • Published Oct 25, 2023