CHARM: Calibrating Reward Models With Chatbot Arena Scores Paper • 2504.10045 • Published Apr 14, 2025
XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation Paper • 2503.22973 • Published Mar 29, 2025
DocHPLT: A Massively Multilingual Document-Level Translation Dataset Paper • 2508.13079 • Published Aug 18, 2025 • 1
MatheMagic: Generating Dynamic Mathematics Benchmarks Robust to Memorization Paper • 2510.05962 • Published Oct 7, 2025
HPLT 3.0: Very Large-Scale Multilingual Resources for LLM and MT. Mono- and Bi-lingual Data, Multilingual Evaluation, and Pre-Trained Models Paper • 2511.01066 • Published Nov 2, 2025 • 2
Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation Paper • 2606.06428 • Published 21 days ago • 25
Combining On-Policy Optimization and Distillation for Long-Context Reasoning in Large Language Models Paper • 2605.12227 • Published May 12 • 1
BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation Paper • 2604.09497 • Published Apr 10 • 29
BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation Paper • 2604.09497 • Published Apr 10 • 29
BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation Paper • 2604.09497 • Published Apr 10 • 29
BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs Paper • 2604.02045 • Published Apr 2 • 38
BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs Paper • 2604.02045 • Published Apr 2 • 38
utter-project/EuroMoE-2.6B-A0.6B-Instruct-2512 Text Generation • 3B • Updated Feb 15 • 542 • 12