SII-Jhao Zhang

JingHaoZ

2 19 3

https://jinghaoleven.github.io

AI & ML interests

Large Reasoning Model, Unified Understanding and Generation in MLLM

Recent Activity

upvoted an article about 6 hours ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

upvoted a paper 17 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

updated a collection 26 days ago

RLVR-Schedule

View all activity

Organizations

upvoted an article about 6 hours ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 167

upvoted a paper 17 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 23 days ago • 208

upvoted a paper about 1 month ago

Not only where, But when: Temporal Scheduling for RLVR

Paper • 2605.25381 • Published May 25 • 6

upvoted a paper about 2 months ago

SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper • 2605.06614 • Published May 7 • 47

upvoted 2 papers 3 months ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 179

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 353

upvoted a paper 5 months ago

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Paper • 2602.02437 • Published Feb 2 • 80

upvoted 2 papers 8 months ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 129

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21, 2025 • 68

upvoted 2 papers 9 months ago

RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Paper • 2510.10201 • Published Oct 11, 2025 • 36

Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models

Paper • 2510.01304 • Published Oct 1, 2025 • 11

upvoted a paper 10 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 90

upvoted a paper 11 months ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Paper • 2508.04700 • Published Aug 6, 2025 • 52

upvoted a collection about 1 year ago

LUFFY-RL

Collection

9 items • Updated May 30, 2025 • 10

upvoted 3 papers over 1 year ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24, 2025 • 31

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 146

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124

upvoted a collection almost 2 years ago

LMMs-Eval

Collection

Dataset Collection of LMMs-Eval • 35 items • Updated Mar 2 • 33

upvoted a collection about 2 years ago

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 41 items • Updated Mar 2 • 154

SII-Jhao Zhang

AI & ML interests

Recent Activity

Organizations

JingHaoZ's activity

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries