DatarrX/myX-Tokenizer
Feature Extraction • Updated
A comprehensive collection of syllable-aware tokenizers optimized for Burmese-English NLP tasks, developed by DatarrX.
Note Highly recommended for LLM training
Note Official training data for the myX series