Geyang's picture

Geyang

geyang627

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago

geyang627/care_pro

published a dataset about 1 month ago

geyang627/care_pro

upvoted a paper 3 months ago

Safe and Scalable Web Agent Learning via Recreated Websites

View all activity

Organizations

upvoted a paper 3 months ago

Safe and Scalable Web Agent Learning via Recreated Websites

Paper • 2603.10505 • Published Mar 11 • 27

upvoted 2 articles 4 months ago

Article

Deriving the PPO Loss from First Principles

garg-aayush

•

Dec 25, 2025

• 45

Article

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

karina-zadorozhny

•

Jan 19

• 31

upvoted a collection 11 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.82k

upvoted a collection about 1 year ago

CARE

14 items • Updated Jun 30, 2025 • 2

upvoted a paper about 2 years ago

Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment

Paper • 2311.04072 • Published Nov 7, 2023 • 1