view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth mlabonne • Jul 29, 2024 • 373
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge NormalUhr • Feb 7, 2025 • 295
view article Article Comprehensive Guide to Reinforcement Learning in Modern AI ProCreations • Jun 9, 2025 • 2
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 wenhuach, Haihao, weiweiz1, n1ck-guo, isaacmac, kding1, IlyasMoutawwakil, marcsun13, medmekk • Apr 29, 2025 • 45