Huo's picture

4 2

Huo

Yupeng123

hyyp1

AI & ML interests

AI NLP

Recent Activity

updated a model 1 day ago

Yupeng123/AtomMem-8B

upvoted a paper 7 days ago

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

liked a model 8 days ago

Yupeng123/AtomMem-8B

View all activity

Organizations

None yet

updated a model 1 day ago

Yupeng123/AtomMem-8B

8B • Updated 1 day ago • 11 • 1

upvoted a paper 7 days ago

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Paper • 2601.13761 • Published 8 days ago • 15

liked a model 8 days ago

Yupeng123/AtomMem-8B

8B • Updated 1 day ago • 11 • 1

published a model 9 days ago

Yupeng123/AtomMem-8B

8B • Updated 1 day ago • 11 • 1

liked a model 15 days ago

openbmb/AgentCPM-Explore

Text Generation • 4B • Updated 10 days ago • 3.36k • 400

upvoted a paper 3 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40

authored 2 papers 7 months ago

GUICourse: From General Vision Language Models to Versatile GUI Agents

Paper • 2406.11317 • Published Jun 17, 2024 • 1

AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning

Paper • 2506.01391 • Published Jun 2, 2025

upvoted 2 papers 7 months ago

Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective

Paper • 2506.17930 • Published Jun 22, 2025 • 18

ReDit: Reward Dithering for Improved LLM Policy Optimization

Paper • 2506.18631 • Published Jun 23, 2025 • 7