Add curated training tokens (266M tokens, Chinchilla-optimal) c725486 verified LisaMegaWatts commited on Feb 27