Commit History

Final Rescue: Full vocab restoration with Unigram scores and Metaspace fix
0d52c92
verified

suchirsalhan commited on

Final Fix: Correct Metaspace mapping and Unigram scores
79f5510
verified

suchirsalhan commited on

Fix: Final Metaspace decoding using exact vocab marker
dd7139b
verified

suchirsalhan commited on

Fix: Robust SPM-to-HF conversion with merges and byte-fallback
9fb11b8
verified

suchirsalhan commited on

Fix: Final Metaspace decoding using exact vocab marker
dacd2f0
verified

suchirsalhan commited on

Fix: Used Llama-style identity wrapping to preserve SPM IDs and fix spacing
bb8d4b5
verified

suchirsalhan commited on

Fix: Final Metaspace decoding using exact vocab marker
ff01214
verified

suchirsalhan commited on

Fix: Reverted to native SentencePiece handling (removed ByteLevel mismatch)
6c75610
verified

suchirsalhan commited on

Fix: Applied ByteLevel pre-tokenization and decoding for proper spacing/hex handling
837e60a
verified

suchirsalhan commited on

Fix: Clean base tokenizer using universal metaspace decoding
15e1bd6
verified

suchirsalhan commited on

Fix: Universal Metaspace decoding without keyword conflicts
a29d3e1
verified

suchirsalhan commited on

Fix: Manual vocab extraction from spm.model pieces
425507b
verified

suchirsalhan commited on

Fix: Environment issues resolved, full Fast Tokenizer uploaded.
3d13f8f
verified

suchirsalhan commited on

Upload folder using huggingface_hub
779ed03
verified

suchirsalhan commited on

Upload folder using huggingface_hub
8581359
verified

suchirsalhan commited on

Upload folder using huggingface_hub
677826a
verified

suchirsalhan commited on

Upload folder using huggingface_hub
42a0f89
verified

suchirsalhan commited on

Upload folder using huggingface_hub
7d5461f
verified

suchirsalhan commited on

Upload folder using huggingface_hub
1a6b4dc
verified

suchirsalhan commited on

Upload folder using huggingface_hub
2b3da18
verified

suchirsalhan commited on

Upload folder using huggingface_hub
690ae2a
verified

suchirsalhan commited on

Update tokenizer_config.json
92187e9
verified

suchirsalhan commited on

Update tokenizer_config.json
6259db5
verified

suchirsalhan commited on

Update tokenizer_config.json
dab49e5
verified

suchirsalhan commited on

Upload folder using huggingface_hub
fc0be57
verified

suchirsalhan commited on

Upload folder using huggingface_hub
cbba21a
verified

suchirsalhan commited on

initial commit
93c7a73
verified

suchirsalhan commited on