AI & ML interests
Ultra-clean datasets, optimised multilingual tokenizers, and open data infrastructure for language models. 71 languages, 26 script families. quartz.host
No public activity
Ultra-clean datasets, optimised multilingual tokenizers, and open data infrastructure for language models. 71 languages, 26 script families. quartz.host
No public activity