lama3-sindhi-tokenizer / tokenizer.json
Kashif786's picture
Overwriting with refined 20k Unigram Sindhi vocabulary for thesis research
bd40019 verified
This file is stored with Xet . It is too big to display, but you can still download it.

Large File Pointer Details

( Raw pointer file )
SHA256:
a01e422490f744c2231e0eb1f1b05ba6b4bdae763684165916eefbda128dea74
Pointer size:
133 Bytes
·
Size of remote file:
20.5 MB
·
Xet hash:
d674cf7331f4c4b20a3ecc3d37cf8c4758f1c46f45cfd7b286f98ec4b8080609

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.