Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling Paper • 2604.28075 • Published 6 days ago • 14
Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling Paper • 2604.28075 • Published 6 days ago • 14
FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition Paper • 2512.13884 • Published Dec 15, 2025 • 15
Llama-2 Adapters Collection Template: "### r/{subreddit} Question:\n\n{instruction}\n\n### r/{subreddit} Answer:\n\n" • 11 items • Updated Dec 15, 2025