35 163

zhang

cheng1

zebrajack

AI & ML interests

CV, NLP. RL

Recent Activity

liked a model about 7 hours ago

unsloth/gemma-4-12b-it-GGUF

liked a model 2 days ago

baidu/Unlimited-OCR

liked a Space 4 days ago

facebook/vggt-omega

View all activity

Organizations

None yet

upvoted a paper 10 days ago

Visual Document Understanding and Question Answering: A Multi-Agent Collaboration Framework with Test-Time Scaling

Paper • 2508.03404 • Published Aug 5, 2025 • 5

upvoted a paper about 2 months ago

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Paper • 2604.14268 • Published Apr 15 • 127

upvoted a paper 2 months ago

Pretrained Video Models as Differentiable Physics Simulators for Urban Wind Flows

Paper • 2603.21210 • Published Mar 22 • 1

upvoted a collection 3 months ago

💧 LFM2.5

Collection

Collection of post-trained and base LFM2.5 models. • 12 items • Updated 11 days ago • 158

upvoted an article 5 months ago

Article

Introducing OptiMind, a research model designed for optimization

microsoft

•

Jan 15

• 35

upvoted 5 papers 9 months ago

FastViDAR: Real-Time Omnidirectional Depth Estimation via Alternative Hierarchical Attention

Paper • 2509.23733 • Published Sep 28, 2025 • 1

DA^2: Depth Anything in Any Direction

Paper • 2509.26618 • Published Sep 30, 2025 • 26

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published Jul 8, 2025 • 60

LL3M: Large Language 3D Modelers

Paper • 2508.08228 • Published Aug 11, 2025 • 2

HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing

Paper • 2507.11971 • Published Jul 16, 2025 • 1

upvoted an article 10 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497

upvoted a paper 10 months ago

Timer-XL: Long-Context Transformers for Unified Time Series Forecasting

Paper • 2410.04803 • Published Oct 7, 2024 • 2

upvoted an article about 1 year ago

Article

Introducing smolagents: simple agents that write actions in code.

m-ric, merve, thomwolf

•

Dec 31, 2024

• 1.2k

upvoted a paper about 1 year ago

PE3R: Perception-Efficient 3D Reconstruction

Paper • 2503.07507 • Published Mar 10, 2025 • 10

upvoted an article about 1 year ago

Article

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

andito, mfarre, merve

•

Jan 23, 2025

• 192

upvoted a paper almost 2 years ago

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

Paper • 2409.05591 • Published Sep 9, 2024 • 31

upvoted 4 papers about 2 years ago

zhang

AI & ML interests

Recent Activity

Organizations

cheng1's activity

Introducing OptiMind, a research model designed for optimization

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Introducing smolagents: simple agents that write actions in code.

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!