·
AI & ML interests
LLM pretraining from scratch, single-GPU training, bilingual EN/IT models, Supervised finetuning SFT, Reinforcement learning GRPO/GSPO
Recent Activity
Organizations
None yet
view article Open-R1: a fully open reproduction of DeepSeek-R1


- +1
eliebak, lvwerra, lewtun
• • 889