Teuken-7B-v0.6 Collection OpenGPT-X Teuken 7B models trained on 6 trillion tokens. • 2 items • Updated Jul 28, 2025 • 5
Teuken-7B-v0.6 Collection OpenGPT-X Teuken 7B models trained on 6 trillion tokens. • 2 items • Updated Jul 28, 2025 • 5
Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models Paper • 2505.22232 • Published May 28, 2025 • 18