Commit History

Final Rescue: Full vocab restoration with Unigram scores and Metaspace fix
a76cdb7
verified

suchirsalhan commited on

Final Fix: Correct Metaspace mapping and Unigram scores
90447ca
verified

suchirsalhan commited on

Fix: Final Metaspace decoding using exact vocab marker
cdd44c2
verified

suchirsalhan commited on

Fix: Robust SPM-to-HF conversion with merges and byte-fallback
141ef67
verified

suchirsalhan commited on

Fix: Final Metaspace decoding using exact vocab marker
422a108
verified

suchirsalhan commited on

Fix: Reverted to native SentencePiece handling (removed ByteLevel mismatch)
22f93a2
verified

suchirsalhan commited on

Fix: Applied ByteLevel pre-tokenization and decoding for proper spacing/hex handling
cb9966c
verified

suchirsalhan commited on

Fix: Clean base tokenizer using universal metaspace decoding
14f9844
verified

suchirsalhan commited on

Fix: Universal Metaspace decoding without keyword conflicts
08df509
verified

suchirsalhan commited on

Fix: Manual vocab extraction from spm.model pieces
61176b3
verified

suchirsalhan commited on

Fix: Environment issues resolved, full Fast Tokenizer uploaded.
9605799
verified

suchirsalhan commited on

Upload folder using huggingface_hub
a31abe2
verified

suchirsalhan commited on

Upload folder using huggingface_hub
77defca
verified

suchirsalhan commited on

Upload folder using huggingface_hub
13eef43
verified

suchirsalhan commited on

Upload folder using huggingface_hub
7982687
verified

suchirsalhan commited on

Upload folder using huggingface_hub
b6920df
verified

suchirsalhan commited on

Upload folder using huggingface_hub
3cd6d0c
verified

suchirsalhan commited on

Upload folder using huggingface_hub
6245e82
verified

suchirsalhan commited on

Upload folder using huggingface_hub
f77d8aa
verified

suchirsalhan commited on

Upload folder using huggingface_hub
903c894
verified

suchirsalhan commited on

initial commit
cec4a0f
verified

suchirsalhan commited on