Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking Paper • 2602.21196 • Published 16 days ago • 5
GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human Paper • 2501.11012 • Published Jan 19, 2025
NorEval: A Norwegian Language Understanding and Generation Evaluation Benchmark Paper • 2504.07749 • Published Apr 10, 2025 • 1
Small Languages, Big Models: A Study of Continual Training on Languages of Norway Paper • 2412.06484 • Published Dec 9, 2024
Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles Paper • 2501.07718 • Published Jan 13, 2025
A Collection of Question Answering Datasets for Norwegian Paper • 2501.11128 • Published Jan 19, 2025
Beemo: Benchmark of Expert-edited Machine-generated Outputs Paper • 2411.04032 • Published Nov 6, 2024 • 1
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13, 2025 • 2
A Family of Pretrained Transformer Language Models for Russian Paper • 2309.10931 • Published Sep 19, 2023 • 6
RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark Paper • 2010.15925 • Published Oct 29, 2020
Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP models Paper • 2202.07791 • Published Feb 15, 2022
Findings of the The RuATD Shared Task 2022 on Artificial Text Detection in Russian Paper • 2206.01583 • Published Jun 3, 2022 • 1
Vote'n'Rank: Revision of Benchmarking with Social Choice Theory Paper • 2210.05769 • Published Oct 11, 2022
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published Feb 20, 2025 • 194
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published Jan 14, 2025 • 62
The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective Paper • 2412.09460 • Published Dec 12, 2024 • 9