Fix dataset composition percentages and token counts ea63110 verified codelion commited on Nov 2, 2025
Fix dataset mix to 50-30-20 and update code snippet with better generation params 5045572 verified codelion commited on Nov 2, 2025
Add comprehensive model card with benchmark results a61f958 verified codelion commited on Nov 1, 2025
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 9fac7e5 verified codelion commited on Nov 1, 2025