7 15

haiyimei

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

upvoted a collection about 2 months ago

SenseNova-U1

liked a model 3 months ago

google/gemma-4-31B-it

View all activity

Organizations

upvoted a paper about 2 months ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper • 2605.00658 • Published May 1 • 86

upvoted a collection about 2 months ago

SenseNova-U1

Collection

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 10 items • Updated 17 days ago • 74

liked 2 models 3 months ago

google/gemma-4-31B-it

Image-Text-to-Text • 33B • Updated 26 days ago • 11M • • 3.08k

dealignai/Gemma-4-31B-JANG_4M-CRACK

Image-Text-to-Text • 6B • Updated Apr 25 • 41.7k • 1.67k

upvoted 2 papers 3 months ago

Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer

Paper • 2603.19227 • Published Mar 19 • 42

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

upvoted a paper 4 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 526

liked a Space 5 months ago

Qwen3-TTS Demo

🎙

Generate speech from text using voice design, cloning or presets

upvoted a paper 8 months ago

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Paper • 2510.26794 • Published Oct 30, 2025 • 27

liked a model 10 months ago

openbmb/MiniCPM-V-4_5

Image-Text-to-Text • 9B • Updated Mar 10 • 86.7k • 1.09k

liked a Space 10 months ago

FastVLM WebGPU

🍎

446

Real-time video captioning powered by FastVLM

liked a model about 1 year ago

sand-ai/MAGI-1

Image-to-Video • Updated 13 days ago • 609

liked a dataset about 1 year ago

caizhongang/SynBody

Updated Nov 4, 2024 • 333 • 6

authored a paper over 1 year ago

WHAC: World-grounded Humans and Cameras

Paper • 2403.12959 • Published Mar 19, 2024 • 4

upvoted an article over 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

liked 4 models over 1 year ago

liked a model about 2 years ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 3.48k • • 4.98k