Final Rescue: Full vocab restoration with Unigram scores and Metaspace fix baaff89 verified suchirsalhan commited on 5 days ago
Final Fix: Correct Metaspace mapping and Unigram scores b868b13 verified suchirsalhan commited on 5 days ago
Fix: Final Metaspace decoding using exact vocab marker 5c33a2d verified suchirsalhan commited on 5 days ago
Fix: Robust SPM-to-HF conversion with merges and byte-fallback 4dfa217 verified suchirsalhan commited on 5 days ago
Fix: Final Metaspace decoding using exact vocab marker 3849f9a verified suchirsalhan commited on 5 days ago
Fix: Used Llama-style identity wrapping to preserve SPM IDs and fix spacing 9596a96 verified suchirsalhan commited on 5 days ago
Fix: Final Metaspace decoding using exact vocab marker 423415a verified suchirsalhan commited on 5 days ago
Fix: Reverted to native SentencePiece handling (removed ByteLevel mismatch) 8c3acff verified suchirsalhan commited on 5 days ago
Fix: Applied ByteLevel pre-tokenization and decoding for proper spacing/hex handling a50f618 verified suchirsalhan commited on 5 days ago
Fix: Clean base tokenizer using universal metaspace decoding 3a96c78 verified suchirsalhan commited on 5 days ago
Fix: Universal Metaspace decoding without keyword conflicts 4c12eb7 verified suchirsalhan commited on 5 days ago
Fix: Manual vocab extraction from spm.model pieces 4e7922c verified suchirsalhan commited on 5 days ago
Fix: Environment issues resolved, full Fast Tokenizer uploaded. c40c4b9 verified suchirsalhan commited on 5 days ago