Huo's picture

4 2

Huo

Yupeng123

hyyp1

AI & ML interests

AI NLP

Recent Activity

updated a model 1 day ago

Yupeng123/AtomMem-8B

upvoted a paper 7 days ago

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

liked a model 9 days ago

Yupeng123/AtomMem-8B

View all activity

Organizations

None yet

upvoted a paper 7 days ago

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Paper • 2601.13761 • Published 9 days ago • 15

upvoted a paper 3 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40

upvoted 2 papers 7 months ago

Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective

Paper • 2506.17930 • Published Jun 22, 2025 • 18

ReDit: Reward Dithering for Improved LLM Policy Optimization

Paper • 2506.18631 • Published Jun 23, 2025 • 7