Fix NameError: make dataset module self-contained with its own CharTokenizer\n\nDuplicates the minimal CharTokenizer in the dataset module to avoid import dependency issues." 9d170ee verified krystv commited on 3 days ago