Pre-Training Curriculum for Multi-Token Prediction in Language Models Paper • 2505.22757 • Published May 28, 2025
Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling Paper • 2604.28075 • Published 7 days ago • 15
OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs Paper • 2309.03876 • Published Sep 7, 2023 • 3
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity Paper • 2401.17072 • Published Jan 30, 2024 • 25