27 26 8

Ho Kei Cheng PRO

hkchengrex

https://hkchengrex.com/

AI & ML interests

None yet

Recent Activity

liked a dataset 9 days ago

XDOF/ABC-130k

upvoted a paper about 1 month ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

new activity about 1 month ago

hkchengrex/MMAudio:MMAudio Demo Data Handling/Retention?

View all activity

Organizations

liked a dataset 9 days ago

XDOF/ABC-130k

Updated 1 day ago • 372k • 60

upvoted a paper about 1 month ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published May 26 • 144

New activity in hkchengrex/MMAudio about 1 month ago

MMAudio Demo Data Handling/Retention?

#33 opened about 1 month ago by deleted

New activity in hkchengrex/MMAudio about 2 months ago

help, it broke again

#32 opened about 2 months ago by

bob5272

updated a Space about 2 months ago

MMAudio — generating synchronized audio from video/text

🔊

969

Generate synchronized audio for videos from text prompts

upvoted a paper 2 months ago

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published Apr 24 • 64

upvoted a paper 3 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 248

New activity in hkchengrex/MMAudio 3 months ago

Fix type annotation

#31 opened 3 months ago by

hysts

Why my api can't work

#30 opened 3 months ago by

BigfufuOuO

upvoted a paper 3 months ago

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published Mar 29 • 33

liked a model 3 months ago

facebook/sam3.1

Mask Generation • Updated Mar 27 • 117k • 390

upvoted 2 papers 3 months ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published Mar 24 • 37

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published Mar 23 • 125

New activity in hkchengrex/MMAudio 4 months ago

changed the language ..

#28 opened 4 months ago by

Retpe

updated a model 4 months ago

hkchengrex/MMAudio

Updated Feb 19 • 126

upvoted a paper 4 months ago

VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

Paper • 2602.13294 • Published Feb 9 • 13

New activity in hkchengrex/MMAudio 5 months ago

help It broke again

#26 opened 5 months ago by

bob5272

upvoted a paper 5 months ago

VideoMaMa: Mask-Guided Video Matting via Generative Prior

Paper • 2601.14255 • Published Jan 20 • 15

New activity in hkchengrex/MMAudio 5 months ago

help

#24 opened 5 months ago by

jackkyyyys

authored a paper 6 months ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 137