Kexin Huang's picture

Kexin Huang

737443h

·

https://kexinhuang02.github.io

AI & ML interests

None yet

Recent Activity

submitted a paper 6 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

authored a paper 8 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

authored a paper 8 days ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

View all activity

Organizations

None yet

upvoted 2 papers 13 days ago

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Paper • 2603.22446 • Published 15 days ago • 8

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published 15 days ago • 29

upvoted a paper 6 months ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 120

upvoted a paper 10 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

upvoted an article 12 months ago

Article

Visualize and understand GPU memory in PyTorch

Dec 24, 2024

•

269

upvoted a paper about 1 year ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

upvoted a paper over 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 377