view article Article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch AviSoori1x • May 7, 2024 • 121
view article Article Welcome Llama 3 - Meta's new open LLM +3 philschmid, osanseviero, pcuenq, ybelkada, lvwerra • Apr 18, 2024 • 295