Initial commit of 100M training tokens on gpt2-small, pythia-160m-deduped, opt-125m f7a232d verified evanhanders commited on Jun 7, 2024