view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge NormalUhr • Feb 7, 2025 • 295
view article Article A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond karina-zadorozhny • Jan 19 • 33
NbAiLab/wav2vec2-large-danish-npsc-nst Automatic Speech Recognition • 0.3B • Updated Jan 6, 2025 • 5.44k • 2
language-and-voice-lab/wav2vec2-large-xlsr-53-icelandic-ep30-967h Automatic Speech Recognition • Updated Apr 25, 2025 • 411k • 3
language-and-voice-lab/whisper-large-icelandic-62640-steps-967h Automatic Speech Recognition • Updated Apr 25, 2025 • 278 • 4
NbAiLab/nb-whisper-large-distil-turbo-beta Automatic Speech Recognition • 0.8B • Updated Sep 10, 2025 • 865 • 11
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook 📚 3.22k The secrets to building world-class LLMs