tiny-dolma10M / README.md
ThomasTheMaker's picture
Update README.md
6a01a5d verified
metadata
license: apache-2.0
datasets:
  - ThomasTheMaker/pretokenized-dolma-10M
  - allenai/dolma
language:
  - en

An 11M model, pre-trained on 10M rows of dataset from Dolma