zhu's picture

zhu

xuekai

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

upvoted a paper 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

upvoted a paper 5 months ago

Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

View all activity

Organizations

commented a paper 8 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 119 •

commented 2 papers 9 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 119 •

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 119 •

commented a paper over 1 year ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 53 •

New activity in allenai/dolma over 2 years ago

JSON ERROR in loading files of v1_6-sample using load_dataset

#22 opened over 2 years ago by