Final Rescue: Full vocab restoration with Unigram scores and Metaspace fix 4efabc8 verified suchirsalhan commited on 6 days ago
Final Fix: Correct Metaspace mapping and Unigram scores 01c823f verified suchirsalhan commited on 6 days ago
Fix: Final Metaspace decoding using exact vocab marker e899c37 verified suchirsalhan commited on 6 days ago
Fix: Robust SPM-to-HF conversion with merges and byte-fallback 75dbc72 verified suchirsalhan commited on 6 days ago
Fix: Final Metaspace decoding using exact vocab marker 5665244 verified suchirsalhan commited on 6 days ago
Fix: Used Llama-style identity wrapping to preserve SPM IDs and fix spacing ec89e42 verified suchirsalhan commited on 6 days ago
Fix: Final Metaspace decoding using exact vocab marker f500518 verified suchirsalhan commited on 6 days ago
Fix: Reverted to native SentencePiece handling (removed ByteLevel mismatch) 78bf40b verified suchirsalhan commited on 6 days ago
Fix: Applied ByteLevel pre-tokenization and decoding for proper spacing/hex handling ce9873a verified suchirsalhan commited on 6 days ago
Fix: Clean base tokenizer using universal metaspace decoding 3fd6229 verified suchirsalhan commited on 6 days ago
Fix: Universal Metaspace decoding without keyword conflicts 70dec91 verified suchirsalhan commited on 6 days ago
Fix: Manual vocab extraction from spm.model pieces 233ea46 verified suchirsalhan commited on 6 days ago
Fix: Environment issues resolved, full Fast Tokenizer uploaded. 5381407 verified suchirsalhan commited on 6 days ago