Zilong Huang's picture

Zilong Huang

speedinghzl

·

speedinghzl

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 days ago

authored a paper about 2 months ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

authored a paper about 2 months ago

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World

View all activity

Organizations

None yet

upvoted a collection 2 days ago

GenLIP

Model weights of paper "Let ViT Speak: Generative Language-Image Pre-training" • 6 items • Updated May 5 • 8

authored 10 papers about 2 months ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published Jul 10, 2025 • 51

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World

Paper • 2506.24102 • Published Jun 30, 2025 • 1

ThinkGen: Generalized Thinking for Visual Generation

Paper • 2512.23568 • Published Dec 29, 2025 • 1

Agriculture-Vision: A Large Aerial Image Database for Agricultural Pattern Analysis

Paper • 2001.01306 • Published Jan 5, 2020

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 37

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

Paper • 2204.05525 • Published Apr 12, 2022

CodeDance: A Dynamic Tool-integrated MLLM for Executable Visual Reasoning

Paper • 2512.17312 • Published Dec 19, 2025 • 3

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Paper • 2603.12267 • Published Mar 12 • 13

Mixture-of-Depths Attention

Paper • 2603.15619 • Published Mar 16 • 81

Let ViT Speak: Generative Language-Image Pre-training

Paper • 2605.00809 • Published May 1 • 33

upvoted 2 papers about 2 months ago

CodeDance: A Dynamic Tool-integrated MLLM for Executable Visual Reasoning

Paper • 2512.17312 • Published Dec 19, 2025 • 3

Let ViT Speak: Generative Language-Image Pre-training

Paper • 2605.00809 • Published May 1 • 33

liked a model about 2 months ago

hustvl/SuperCLIP

Updated Dec 24, 2025 • 3

upvoted a collection 3 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.69k

liked a Space 7 months ago

Depth Anything 3

Generate depth maps from your photos

liked a dataset 8 months ago

ServiceNow/GroundCUA

Preview • Updated Dec 24, 2025 • 52.2k • 37

updated a model 11 months ago

speedinghzl/Superclass

Updated Jul 31, 2025 • 1

liked 2 datasets about 1 year ago

yangjie-cv/WeThink_Multimodal_Reasoning_120K

Viewer • Updated Jun 10, 2025 • 126k • 44 • 9

Lixsp11/Sekai-Project

Viewer • Updated Oct 22, 2025 • 344k • 1.09k • 43