Buckets:

48.6 GB
67 files
Updated 25 days ago
Name
Size
tokenized
raw
val.bin8.24 MB
xet
train.bin31.1 GB
xet
tokenizer.json2.46 MB
xet
state.json19.8 kB
xet
meta.json326 Bytes
xet
Total size
48.6 GB
Files
67
Last updated
May 31
Pre-warmed CDN
US EU US EU

Contributors