RLFR - a JingHaoZ Collection

JingHaoZ 's Collections

RLFR

updated Oct 14, 2025

Extending Reinforcement Learning for LLMs with Flow Environment

JingHaoZ/RLFR-Qwen2.5-Math-7B

Text Generation • 8B • Updated Oct 14, 2025 • 5
JingHaoZ/RLFR-Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Oct 14, 2025 • 5 • 1
JingHaoZ/RLFR-Dataset-LM

Viewer • Updated Nov 14, 2025 • 102k • 216
JingHaoZ/RLFR-Dataset-VLM

Preview • Updated Oct 14, 2025 • 44
RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Paper • 2510.10201 • Published Oct 11, 2025 • 36