SindhiLM-Tokenizer-v1 / tokenizer.json
aakashMeghwar01's picture
Upgraded architecture: Unigram + Metaspace fix + 32k Vocab limit + Morpheme splitting
be8d37f verified
This file is stored with Xet . It is too big to display, but you can still download it.

Large File Pointer Details

( Raw pointer file )
SHA256:
cdc6e4b6e6c5771ecea76bbb66cc9fb3f4c6e47a4c35f3e8cc327b6733478e1f
Pointer size:
129 Bytes
·
Size of remote file:
4.22 kB
·
Xet hash:
aa03b250893bc30cc6f41db01953bd7edba2bf9d68eaee62b257fa118d8f08f1

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.