Commit History

Fix dataset composition percentages and token counts
ea63110
verified

codelion commited on

Fix dataset mix to 50-30-20 and update code snippet with better generation params
5045572
verified

codelion commited on

Fix citation to reference blog post
7ba6110
verified

codelion commited on

Update README.md
39e5c69
verified

codelion commited on

Update README.md
162978d
verified

codelion commited on

Update README.md
dd65546
verified

codelion commited on

Add comprehensive model card with benchmark results
a61f958
verified

codelion commited on

Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens)
9fac7e5
verified

codelion commited on

initial commit
7e025de
verified

codelion commited on