KletterMix: Climbing Toward High-Quality German Pretraining Data Paper • 2606.03773 • Published 25 days ago • 21
Enhancing Temporal Understanding in Video-LLMs through Stacked Temporal Attention in Vision Encoders Paper • 2510.26027 • Published Oct 29, 2025
Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models Paper • 2505.22232 • Published May 28, 2025 • 18
Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions? Paper • 2402.13703 • Published Feb 21, 2024
Tokenizer Choice For LLM Training: Negligible or Crucial? Paper • 2310.08754 • Published Oct 12, 2023 • 3
Towards Cross-Lingual LLM Evaluation for European Languages Paper • 2410.08928 • Published Oct 11, 2024 • 2
Do Multilingual Large Language Models Mitigate Stereotype Bias? Paper • 2407.05740 • Published Jul 8, 2024