6 15 186

Jian Hu

chuyi777

https://hujian.website

hijkzzz

AI & ML interests

Reinforcement Learning

Recent Activity

updated a model about 2 months ago

OpenRLHF/Llama-3-8b-rm-700k

upvoted a paper 3 months ago

PhyCritic: Multimodal Critic Models for Physical AI

updated a dataset 3 months ago

OpenRLHF/aime-2024

View all activity

Organizations

updated a model about 2 months ago

OpenRLHF/Llama-3-8b-rm-700k

Text Ranking • 8B • Updated Mar 16 • 1.18k • 3

upvoted a paper 3 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

updated 2 datasets 3 months ago

OpenRLHF/aime-2024

Viewer • Updated Feb 6 • 30 • 692

OpenRLHF/dapo-math-17k

Viewer • Updated Feb 6 • 17.4k • 120

authored a paper 3 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 111

published 2 datasets 3 months ago

OpenRLHF/aime-2024

Viewer • Updated Feb 6 • 30 • 692

OpenRLHF/dapo-math-17k

Viewer • Updated Feb 6 • 17.4k • 120

upvoted a paper 3 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 111

upvoted 2 papers 7 months ago

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published Oct 16, 2025 • 18

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1, 2025 • 20

liked 2 models 8 months ago

moonshotai/Kimi-K2-Instruct-0905

Text Generation • 1T • Updated Jan 30 • 768k • • 708

nvidia/NVIDIA-Nemotron-Nano-12B-v2

Text Generation • Updated Nov 25, 2025 • 12.8k • • 163

updated a dataset 8 months ago

OpenRLHF/gem_guess_game

Viewer • Updated Aug 30, 2025 • 2.05k • 8 • 1

published a dataset 8 months ago

OpenRLHF/gem_guess_game

Viewer • Updated Aug 30, 2025 • 2.05k • 8 • 1

New activity in nvidia/NVIDIA-Nemotron-Nano-9B-v2 8 months ago

some problem when I asked the model: 你是谁？

🤯 2

#8 opened 9 months ago by

wenzel94

upvoted a paper 9 months ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published Aug 11, 2025 • 50

liked 2 models 9 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 7.23M • • 4.59k

mistralai/Devstral-Small-2505

24B • Updated Aug 18, 2025 • 51.1k • 869

liked a dataset 9 months ago

MegaScience/MegaScience

Viewer • Updated Jul 24, 2025 • 1.25M • 18.5k • 130

liked a dataset 10 months ago

newfacade/LeetCodeDataset

Viewer • Updated May 29, 2025 • 2.87k • 2.37k • 64

Jian Hu

AI & ML interests

Recent Activity

Organizations

chuyi777's activity

some problem when I asked the model: 你是谁？