2 15 5

Hanxu Hu PRO

HanxuHU

https://hanxuhu.github.io/

AI & ML interests

LLM, NLP

Recent Activity

authored a paper 20 days ago

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning

authored a paper 20 days ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

upvoted a paper 21 days ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

View all activity

Organizations

authored 2 papers 20 days ago

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning

Paper • 2603.11193 • Published Mar 11

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

Paper • 2606.06428 • Published 22 days ago • 25

upvoted a paper 21 days ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

Paper • 2606.06428 • Published 22 days ago • 25

updated a dataset 21 days ago

HanxuHU/rl-new-language

Viewer • Updated 21 days ago • 135k • 805

updated a collection 21 days ago

RL-unseen-language

Collection

Using RL to elicit context leverage ability of LLMs to learn unseen languages! • 2 items • Updated 21 days ago

submitted a paper to Daily Papers 21 days ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

Paper • 2606.06428 • Published 22 days ago • 25

published a dataset 23 days ago

HanxuHU/rl-new-language

Viewer • Updated 21 days ago • 135k • 805

authored a paper 4 months ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

upvoted a paper 4 months ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

upvoted a paper 8 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 99

published a dataset 8 months ago

HanxuHU/usaco_v2

Viewer • Updated Oct 11, 2025 • 294 • 12

updated a dataset 8 months ago

HanxuHU/ocr_data_question_28k_Qwen3-8B

Viewer • Updated Oct 28, 2025 • 28k • 17

published a dataset 8 months ago

HanxuHU/ocr_data_question_28k_Qwen3-8B

Viewer • Updated Oct 28, 2025 • 28k • 17

Hanxu Hu PRO

AI & ML interests

Recent Activity

Organizations

HanxuHU's activity