2 7 11

Zhuoyi Yang

keg-yzy

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning

upvoted a paper 20 days ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

liked a model 5 months ago

zai-org/GLM-Image

View all activity

Organizations

None yet

upvoted a paper 1 day ago

SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning

Paper • 2606.10804 • Published 2 days ago • 32

upvoted a paper 20 days ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published 23 days ago • 134

liked a model 5 months ago

zai-org/GLM-Image

Text-to-Image • Updated Jan 15 • 7.93k • • 1.07k

liked a model 6 months ago

zai-org/SCAIL-Preview

Updated Dec 16, 2025 • 111

upvoted a paper 6 months ago

SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Paper • 2512.05905 • Published Dec 5, 2025 • 21

liked a model 8 months ago

zai-org/Kaleido-14B-S2V

Updated Dec 11, 2025 • 19

liked a model over 1 year ago

zai-org/CogView4-6B

Text-to-Image • Updated Mar 11, 2025 • 2.77k • • 254

upvoted a paper over 1 year ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6, 2025 • 44

liked a Space over 1 year ago

MotionBench Leaderboard

🐨

Submit and view leaderboard data for model evaluations

liked 3 models over 1 year ago

upvoted a collection almost 2 years ago

CogVideo

Collection

10 items • Updated Jun 30, 2025 • 64

authored 2 papers almost 2 years ago

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Paper • 2408.06072 • Published Aug 12, 2024 • 38

CogVLM: Visual Expert for Pretrained Language Models

Paper • 2311.03079 • Published Nov 6, 2023 • 28

liked a Space almost 2 years ago

CogVideoX-5B

🎥

1.04k

Text-to-Video

liked a model almost 2 years ago

zai-org/CogVideoX-5b

Text-to-Video • Updated Nov 23, 2024 • 16.7k • • 675

upvoted a paper almost 2 years ago

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Paper • 2408.06072 • Published Aug 12, 2024 • 38

liked a model almost 2 years ago

zai-org/CogVideoX-2b

Text-to-Video • Updated Nov 23, 2024 • 21.9k • 363