Yucheng Zhao

yuchengz

1 11 56

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago

openbmb/UltraData-SFT-2605

liked a dataset 3 months ago

nvidia/Nemotron-Cascade-2-RL-data

liked a model 6 months ago

nvidia/nemotron-speech-streaming-en-0.6b

View all activity

Organizations

upvoted a paper 7 months ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 248

upvoted a paper 10 months ago

Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Paper • 2505.23747 • Published May 29, 2025 • 69

upvoted an article about 1 year ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

upvoted an article over 1 year ago

Article

The N Implementation Details of RLHF with PPO

vwxyzjn, tianlinliu0121, lvwerra

•

Oct 24, 2023

• 72

upvoted 2 papers over 1 year ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published Jan 11, 2025 • 31

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1, 2025 • 110

upvoted 2 papers almost 2 years ago

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12, 2024 • 17

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 69

upvoted a collection almost 2 years ago

NuminaMath

Collection

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10, 2025 • 83

upvoted 2 articles about 2 years ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

loubnabnl, anton-l, davanstrien

•

Mar 20, 2024

• 115

Article

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

mirinflim, aldopareja, muellerzr, stas

•

Jun 13, 2024

• 62

Yucheng Zhao

AI & ML interests

Recent Activity

Organizations

yuchengz's activity

Vision Language Models (Better, faster, stronger)

The N Implementation Details of RLHF with PPO

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate