view article Article There is no such thing as a tokenizer-free lunch catherinearnett • Sep 25, 2025 • 100
view article Article An Analysis of Multilingual Models on Hugging Face catherinearnett • Sep 18, 2025 • 6
view article Article Best Practices for Open Multilingual LLM Evaluation catherinearnett • May 7, 2025 • 8
view article Article Releasing the largest multilingual open pretraining dataset Pclanglais • Nov 13, 2024 • 108