·
AI & ML interests
Data Science, ML
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies
upvoted an article about 1 year ago view article Open-R1: a fully open reproduction of DeepSeek-R1
- +1
upvoted a paper over 1 year ago