OlmoLogic Collection Boosting Reasoning via RLVR with Inductive Logic Programming • 5 items • Updated 3 days ago • 4
KletterMix: Climbing Toward High-Quality German Pretraining Data Paper • 2606.03773 • Published 27 days ago • 21
Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models Paper • 2505.22232 • Published May 28, 2025 • 18