Commit History

Manifest β€” 23.1B tokens
2811c99
verified

akpsahan commited on

langchain_docs β€” 46,653,933 tokens (0.1GB)
f402cdd
verified

akpsahan commited on

coder_combined β€” 74,567,978 tokens (0.1GB)
0ed9ce2
verified

akpsahan commited on

sinhalen_gz β€” 2,000,000,000 tokens (3.7GB)
1a45d60
verified

akpsahan commited on

sinhala_cx β€” 8,000,000,000 tokens (14.9GB)
9370bfd
verified

akpsahan commited on

fineweb_edu β€” 5,000,000,000 tokens (9.3GB)
1284c8d
verified

akpsahan commited on

english_cx β€” 8,000,000,000 tokens (14.9GB)
5bc0aae
verified

akpsahan commited on

Upload tokenizer.json
85fbfff
verified

akpsahan commited on

Delete tokenizer.json
4bc63ce
verified

akpsahan commited on

Update tokenizer.json
79ded90
verified

akpsahan commited on

Update tokenizer.json
c0a80a0
verified

akpsahan commited on

Upload tokenizer.json
7c05565
verified

akpsahan commited on

initial commit
9aa7fab
verified

akpsahan commited on