2 4 18

Chenxiao Zhao

ChenShawn

ChenShawn

AI & ML interests

Reinforcement learning

Recent Activity

upvoted an article about 1 month ago

Forge: Scalable Agent RL Framework and Algorithm

liked a dataset about 2 months ago

nvidia/Nemotron-Terminal-Corpus

liked a dataset about 2 months ago

stepfun-ai/Step-3.5-Flash-SFT

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

Forge: Scalable Agent RL Framework and Algorithm

Feb 13

•

151

liked 3 datasets about 2 months ago

authored a paper about 2 months ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

liked a dataset 2 months ago

zai-org/terminal-bench-2-verified

Updated Feb 27 • 2.57k • 71

authored 4 papers 3 months ago

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

Paper • 2602.14234 • Published Feb 15 • 28

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 46

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

Paper • 2505.14362 • Published May 20, 2025 • 5

upvoted 3 papers 3 months ago

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

Paper • 2602.14234 • Published Feb 15 • 28

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

Paper • 2505.14362 • Published May 20, 2025 • 5

liked a dataset 5 months ago

R2E-Gym/R2E-Gym-Subset

Viewer • Updated Apr 11, 2025 • 4.58k • 2.87k • 25

liked a model 5 months ago

Qwen/Qwen3-30B-A3B-Base

Text Generation • 31B • Updated Jul 26, 2025 • 76.9k • 70

liked 2 datasets 6 months ago

nex-agi/agent-sft

Preview • Updated Dec 9, 2025 • 482 • 106

princeton-nlp/SWE-bench

Viewer • Updated Mar 3, 2025 • 21.5k • 28.1k • 135

liked a dataset 7 months ago

milashkaarshif/MoeGirlPedia_wikitext_raw_archive

Viewer • Updated Feb 11 • 381k • 141 • 38

liked a dataset 8 months ago

BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated Apr 18, 2025 • 1.79M • 11k • 170

updated a model 8 months ago

ChenShawn/DeepEyes-rebuttal-model

8B • Updated Sep 19, 2025 • 3 • 1

Chenxiao Zhao

AI & ML interests

Recent Activity

Organizations

ChenShawn's activity

Forge: Scalable Agent RL Framework and Algorithm