zhang

chaopeng

2 3

·

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

zai-org/GLM-5.2

liked a Space 9 months ago

HuggingFaceTB/smol-training-playbook

upvoted an article about 1 year ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

View all activity

Organizations

None yet

upvoted an article about 1 year ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

NormalUhr

•

Feb 11, 2025

• 130

upvoted an article over 1 year ago

Article

Fast, High-Fidelity LLM Decoding with Regex Constraints

vivien

•

Feb 23, 2024

• 12