1 66 3

Li

kotion

AI & ML interests

AI,AIGC

Recent Activity

upvoted a paper about 1 month ago

CutVerse: A Compositional GUI Agents Benchmark for Media Post-Production Editing

upvoted a paper about 1 month ago

Flow-OPD: On-Policy Distillation for Flow Matching Models

upvoted a paper about 2 months ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

View all activity

Organizations

upvoted 2 papers about 1 month ago

CutVerse: A Compositional GUI Agents Benchmark for Media Post-Production Editing

Paper • 2605.19484 • Published May 19 • 21

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published May 8 • 102

upvoted a paper about 2 months ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 119

upvoted 8 papers 2 months ago

FlowAnchor: Stabilizing the Editing Signal for Inversion-Free Video Editing

Paper • 2604.22586 • Published Apr 24 • 16

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Paper • 2604.14268 • Published Apr 15 • 127

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Paper • 2604.08995 • Published Apr 10 • 51

Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation

Paper • 2604.10030 • Published Apr 11 • 15

Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator

Paper • 2604.08121 • Published Apr 9 • 44

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published Apr 13 • 72

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 82

VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization

Paper • 2604.12887 • Published Apr 14 • 5

upvoted 3 papers 6 months ago

UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

Paper • 2601.03193 • Published Jan 6 • 51

Generative Neural Video Compression via Video Diffusion Prior

Paper • 2512.05016 • Published Dec 4, 2025 • 10

IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning

Paper • 2512.15635 • Published Dec 17, 2025 • 20

upvoted 6 papers 7 months ago

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Paper • 2512.07831 • Published Dec 8, 2025 • 17

BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation

Paper • 2511.22973 • Published Nov 28, 2025 • 7

TV2TV: A Unified Framework for Interleaved Language and Video Generation

Paper • 2512.05103 • Published Dec 4, 2025 • 20

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 178

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published Nov 27, 2025 • 29

Video Generation Models Are Good Latent Reward Models

Paper • 2511.21541 • Published Nov 26, 2025 • 49

Li

AI & ML interests

Recent Activity

Organizations

kotion's activity