Nguyen Dang

lucadang

2

·

AI & ML interests

None yet

Organizations

upvoted an article about 1 year ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 297

upvoted an article over 1 year ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

AviSoori1x

•

May 7, 2024

• 124