NUBES: A Corpus of Negation and Uncertainty in Spanish Clinical Texts Paper • 2004.01092 • Published Apr 2, 2020
Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque Paper • 2506.07597 • Published Jun 9, 2025
Summarization Metrics for Spanish and Basque: Do Automatic Scores and LLM-Judges Correlate with Humans? Paper • 2503.17039 • Published Mar 21, 2025
Gender Bias in MT for a Genderless Language: New Benchmarks for Basque Paper • 2603.08153 • Published Mar 9
Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights Paper • 2603.28263 • Published Mar 30
Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights Paper • 2603.28263 • Published Mar 30
MrBERT: Modern Multilingual Encoders via Vocabulary, Domain, and Dimensional Adaptation Paper • 2602.21379 • Published Feb 24 • 1
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 24
Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque Paper • 2506.07597 • Published Jun 9, 2025
GuideX: Guided Synthetic Data Generation for Zero-Shot Information Extraction Paper • 2506.00649 • Published May 31, 2025 • 3
Data Contamination Report from the 2024 CONDA Shared Task Paper • 2407.21530 • Published Jul 31, 2024 • 10
Latxa: An Open Language Model and Evaluation Suite for Basque Paper • 2403.20266 • Published Mar 29, 2024 • 4
Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis Paper • 2404.06392 • Published Apr 9, 2024
Latxa: An Open Language Model and Evaluation Suite for Basque Paper • 2403.20266 • Published Mar 29, 2024 • 4
IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named Entity Recognition using Knowledge Bases Paper • 2304.10637 • Published Apr 20, 2023
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark Paper • 2310.18018 • Published Oct 27, 2023 • 1