4 42 20

Kevin Zhang

Kevin-thu

https://kevin-thu.github.io/homepage

AI & ML interests

Computer Vision, Generation Models, Neural Rendering

Recent Activity

upvoted a paper about 21 hours ago

DanceOPD: On-Policy Generative Field Distillation

upvoted a paper 11 days ago

PermaVid: Consistent Video Generation Across Edits via Disentangled Context Memory

upvoted a paper 11 days ago

Memento: Reconstruct to Remember for Consistent Long Video Generation

View all activity

Organizations

None yet

upvoted a paper about 21 hours ago

DanceOPD: On-Policy Generative Field Distillation

Paper • 2606.27377 • Published 2 days ago • 57

upvoted 2 papers 11 days ago

PermaVid: Consistent Video Generation Across Edits via Disentangled Context Memory

Paper • 2606.16449 • Published 12 days ago • 5

Memento: Reconstruct to Remember for Consistent Long Video Generation

Paper • 2606.14667 • Published 15 days ago • 17

upvoted a paper 12 days ago

MilliVid: Hierarchical Latents for Long-Range Consistency in Video Generation

Paper • 2606.09056 • Published 19 days ago • 6

upvoted 2 papers 19 days ago

Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation

Paper • 2606.04527 • Published 24 days ago • 28

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 26 days ago • 136

upvoted a paper 25 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published May 27 • 431

upvoted 4 papers about 1 month ago

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Paper • 2605.23902 • Published May 22 • 46

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Paper • 2605.27367 • Published May 26 • 72

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published May 19 • 137

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published May 8 • 102

upvoted a paper about 2 months ago

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published Apr 24 • 64

upvoted 2 papers 2 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 167

Lyra 2.0: Explorable Generative 3D Worlds

Paper • 2604.13036 • Published Apr 14 • 41

upvoted 6 papers 3 months ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 179

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published Apr 6 • 204

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation

Paper • 2603.16871 • Published Mar 17 • 61

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 189

Kevin Zhang

AI & ML interests

Recent Activity

Organizations

Kevin-thu's activity