Building a Precise Video Language with Human-AI Oversight Paper • 2604.21718 • Published 26 days ago • 17
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published 24 days ago • 63
CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation Paper • 2604.19636 • Published 27 days ago • 87
Strips as Tokens: Artist Mesh Generation with Native UV Segmentation Paper • 2604.09132 • Published Apr 10 • 55
DeltaTok Collection DeltaTok tokenizer, DeltaWorld predictor, and evaluation heads. https://github.com/amazon-far/deltatok • 7 items • Updated Apr 8 • 8
TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders Paper • 2604.07340 • Published Apr 8 • 17
InCoder-32B-Thinking: Industrial Code World Model for Thinking Paper • 2604.03144 • Published Apr 3 • 233
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published Mar 2 • 151
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space Paper • 2511.10555 • Published Nov 13, 2025 • 63
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 233
UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset Paper • 2510.20661 • Published Oct 23, 2025 • 16
Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis Paper • 2509.09595 • Published Sep 11, 2025 • 48
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning Paper • 2509.08519 • Published Sep 10, 2025 • 130
X-Part: high fidelity and structure coherent shape decomposition Paper • 2509.08643 • Published Sep 10, 2025 • 28