Bingxiang He's picture

Bingxiang He

hbx

·

https://hbx-hbx.github.io/

AI & ML interests

NLP

Recent Activity

upvoted a paper about 12 hours ago

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

upvoted a paper about 12 hours ago

Qwen-AgentWorld: Language World Models for General Agents

upvoted a paper 1 day ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

View all activity

Organizations

commented 2 papers 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113 •

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113 •

commented a paper 4 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60 •

New activity in hbx/JustRL-Nemotron-1.5B 6 months ago

Add Hugging Face paper link badge to model card

#1 opened 6 months ago by

New activity in hbx/JustRL-DeepSeek-1.5B 6 months ago

Improve model card: Update title, add paper link, correct license and citation

#1 opened 6 months ago by

commented a paper 6 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 31 •