AI & ML interests
Ultra-clean datasets, optimised multilingual tokenizers, and open data infrastructure for language models. 71 languages, 26 script families. quartz.host
models 0
None public yet
datasets 0
None public yet
Ultra-clean datasets, optimised multilingual tokenizers, and open data infrastructure for language models. 71 languages, 26 script families. quartz.host