SII-Jhao Zhang

JingHaoZ

2 19 3

https://jinghaoleven.github.io

AI & ML interests

Large Reasoning Model, Unified Understanding and Generation in MLLM

Recent Activity

upvoted an article about 6 hours ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

upvoted a paper 17 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

updated a collection 26 days ago

RLVR-Schedule

View all activity

Organizations

upvoted an article about 6 hours ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 167

upvoted a paper 17 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 23 days ago • 208

updated a collection 26 days ago

RLVR-Schedule

Collection

Not only where, But when: Temporal Scheduling for RLVR • 2 items • Updated 26 days ago

authored a paper about 1 month ago

Not only where, But when: Temporal Scheduling for RLVR

Paper • 2605.25381 • Published May 25 • 6

upvoted a paper about 1 month ago

Not only where, But when: Temporal Scheduling for RLVR

Paper • 2605.25381 • Published May 25 • 6

submitted a paper to Daily Papers about 1 month ago

Not only where, But when: Temporal Scheduling for RLVR

Paper • 2605.25381 • Published May 25 • 6

published a dataset about 1 month ago

JingHaoZ/OpenReasoning

Viewer • Updated May 30 • 30.2k • 54

updated a dataset about 1 month ago

JingHaoZ/OpenReasoning

Viewer • Updated May 30 • 30.2k • 54

upvoted a paper about 2 months ago

SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper • 2605.06614 • Published May 7 • 47

upvoted 2 papers 3 months ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 179

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 353

upvoted a paper 5 months ago

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Paper • 2602.02437 • Published Feb 2 • 80

updated a dataset 8 months ago

JingHaoZ/RLFR-Dataset-LM

Viewer • Updated Nov 14, 2025 • 102k • 221

upvoted 2 papers 8 months ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 129

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21, 2025 • 68

authored a paper 9 months ago

RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Paper • 2510.10201 • Published Oct 11, 2025 • 36

updated a collection 9 months ago

RLFR

Collection

Extending Reinforcement Learning for LLMs with Flow Environment • 5 items • Updated Oct 14, 2025 • 3

upvoted a paper 9 months ago

RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Paper • 2510.10201 • Published Oct 11, 2025 • 36

commented a paper 9 months ago

RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Paper • 2510.10201 • Published Oct 11, 2025 • 36 •

SII-Jhao Zhang

AI & ML interests

Recent Activity

Organizations

JingHaoZ's activity

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries