view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 natolambert, LouisCastricato, lvwerra, Dahoas β’ Dec 9, 2022 β’ 411
view article Article seemore: Implement a Vision Language Model from Scratch AviSoori1x β’ Jun 23, 2024 β’ 108
WAFFLE: Multi-Modal Model for Automated Front-End Development Paper β’ 2410.18362 β’ Published Oct 24, 2024 β’ 13
view article Article Performance Comparison: Llama-3.2 vs. Llama-3.1 LLMs and Smaller Models (3B, 1B) in Medical and Healthcare AI Domains π©Ίπ§¬π aaditya β’ Sep 26, 2024 β’ 7
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf β’ Sep 18, 2024 β’ 280