BERnaT: Basque Encoders for Representing Natural Textual Diversity Paper • 2512.03903 • Published Dec 3, 2025
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data Paper • 2510.10159 • Published Oct 11, 2025 • 3
Open Korean Historical Corpus: A Millennia-Scale Diachronic Collection of Public Domain Texts Paper • 2510.24541 • Published Oct 28, 2025
Dialogue Is Not Enough to Make a Communicative BabyLM (But Neither Is Developmentally Inspired Reinforcement Learning) Paper • 2510.20358 • Published Oct 23, 2025
BLiSS 1.0: Evaluating Bilingual Learner Competence in Second Language Small Language Models Paper • 2510.19419 • Published Oct 22, 2025 • 1
Teacher Demonstrations in a BabyLM's Zone of Proximal Development for Contingent Multi-Turn Interaction Paper • 2510.20411 • Published Oct 23, 2025 • 2
Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues Paper • 2510.19028 • Published Oct 21, 2025 • 8