·
AI & ML interests
None yet
Organizations
upvoted an article about 1 year ago view article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
NormalUhr
• • 293
view article Open-R1: a fully open reproduction of DeepSeek-R1


- +1
eliebak, lvwerra, lewtun
• • 889
upvoted a paper almost 2 years ago upvoted an article about 2 years ago view article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)
wolfram
• • 63