Ho Kei Cheng PRO

hkchengrex

27 26 8

https://hkchengrex.com/

AI & ML interests

None yet

Recent Activity

liked a dataset 16 days ago

XDOF/ABC-130k

upvoted a paper about 1 month ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

new activity about 2 months ago

hkchengrex/MMAudio:MMAudio Demo Data Handling/Retention?

View all activity

Organizations

upvoted a paper about 1 month ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published May 26 • 145

upvoted a paper 2 months ago

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published Apr 24 • 64

upvoted 4 papers 3 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 248

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published Mar 29 • 33

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published Mar 24 • 37

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published Mar 23 • 125

upvoted 2 papers 5 months ago

VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

Paper • 2602.13294 • Published Feb 9 • 13

VideoMaMa: Mask-Guided Video Matting via Generative Prior

Paper • 2601.14255 • Published Jan 20 • 15

upvoted 2 papers 6 months ago

SAM Audio: Segment Anything in Audio

Paper • 2512.18099 • Published Dec 19, 2025 • 25

4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation

Paper • 2512.17012 • Published Dec 18, 2025 • 49

upvoted a paper 7 months ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 138

upvoted a paper 8 months ago

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published Nov 20, 2025 • 117

upvoted a paper 9 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 171

upvoted 4 papers about 1 year ago

The Diffusion Duality

Paper • 2506.10892 • Published Jun 12, 2025 • 37

Perception Encoder: The best visual embeddings are not at the output of the network

Paper • 2504.13181 • Published Apr 17, 2025 • 37

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17, 2025 • 51

Gaussian Mixture Flow Matching Models

Paper • 2504.05304 • Published Apr 7, 2025 • 11

upvoted 3 papers over 1 year ago

CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models

Paper • 2503.18886 • Published Mar 24, 2025 • 24

Tokenize Image as a Set

Paper • 2503.16425 • Published Mar 20, 2025 • 16

The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation

Paper • 2503.10636 • Published Mar 13, 2025 • 3

Ho Kei Cheng PRO

AI & ML interests

Recent Activity

Organizations

hkchengrex's activity