Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Bingxiang He's picture
6 25 11

Bingxiang He

hbx
Trangle's profile picture aakashbilly's profile picture lllyx's profile picture
·
https://hbx-hbx.github.io/
  • hbx_hbx
  • HBX-hbx

AI & ML interests

NLP

Recent Activity

liked a model 3 days ago
lllyx/Qwen3-4B-Base-GRPO
liked a model 3 days ago
lllyx/Qwen3-1.7B-SFT
updated a collection 4 days ago
JustRL
View all activity

Organizations

Tsinghua NLP Group's profile picture ML intern explorers's profile picture

commented a paper 14 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 23 days ago • 90 •
6
commented a paper 22 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 23 days ago • 90 •
6
commented a paper about 2 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59 •
4
New activity in hbx/JustRL-Nemotron-1.5B 5 months ago

Add Hugging Face paper link badge to model card

#1 opened 5 months ago by
nielsr
New activity in hbx/JustRL-DeepSeek-1.5B 5 months ago

Improve model card: Update title, add paper link, correct license and citation

#1 opened 5 months ago by
nielsr
commented a paper 5 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27 •
3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs