view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 natolambert, LouisCastricato, lvwerra, Dahoas • Dec 9, 2022 • 411
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated Mar 12 • 354
SeaLLMs -- Large Language Models for Southeast Asia Paper • 2312.00738 • Published Dec 1, 2023 • 25