William Arthor's picture

3 3

William Arthor

narrowsnap

·

narrowsnap

AI & ML interests

None yet

Organizations

None yet

upvoted an article 7 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 293

upvoted a collection almost 2 years ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 711