Yuhang Zang's picture

Building on HF

Yuhang Zang PRO

yuhangzang

·

https://yuhangzang.github.io/

AI & ML interests

🤗 HuggingFace is all you need

Recent Activity

liked a model about 7 hours ago

internlm/ETCHR-FLUX.2-klein-9B

updated a dataset about 7 hours ago

internlm/CapRL-Video-178K

updated a dataset about 8 hours ago

internlm/DL3DV-2k

View all activity

Organizations

upvoted a collection 8 days ago

Intern-S2

2 items • Updated 8 days ago • 21

upvoted a paper 8 days ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published 12 days ago • 45

upvoted a collection about 2 months ago

Intern-S1

10 items • Updated Mar 27 • 33

upvoted a paper about 2 months ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 133

upvoted 3 papers 2 months ago

From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space

Paper • 2603.12648 • Published Mar 13 • 14

Visual-ERM: Reward Modeling for Visual Equivalence

Paper • 2603.13224 • Published Mar 13 • 21

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Paper • 2603.12252 • Published Mar 12 • 12

upvoted 2 papers 3 months ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 159

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Paper • 2602.08439 • Published Feb 9 • 28

upvoted 2 papers 4 months ago

Unified Personalized Reward Model for Vision Generation

Paper • 2602.02380 • Published Feb 2 • 20

FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation

Paper • 2601.23182 • Published Jan 30 • 21

upvoted 2 collections 4 months ago

UnifiedReward Flex

12 items • Updated about 17 hours ago • 6

LightOnOCR-2 🦉

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated Apr 7 • 24

upvoted a collection 5 months ago

RoPE++

19 items • Updated Dec 9, 2025 • 2

upvoted a paper 5 months ago

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published Dec 8, 2025 • 60

upvoted 3 papers 6 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 50

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Paper • 2512.03036 • Published Dec 2, 2025 • 22

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Paper • 2511.15703 • Published Nov 19, 2025 • 9

upvoted 2 papers 7 months ago

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31, 2025 • 31

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 19