UTTER - Unified Transcription and Translation for Extended Reality

community

https://he-utter.eu/

https://github.com/utter-project

Activity Feed Request to join this org

AI & ML interests

Multimodality, Large Language Models, Speech Processing

Recent Activity

andre-martins new activity 1 day ago

utter-project/EuroMoE-2.6B-A0.6B-Instruct-2512:Please consider adding benchmarks

pinzhenchen authored a paper 5 days ago

CHARM: Calibrating Reward Models With Chatbot Arena Scores

pinzhenchen authored a paper 5 days ago

XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation

View all activity

in utter-project/EuroMoE-2.6B-A0.6B-Instruct-2512 1 day ago

Please consider adding benchmarks

#3 opened 1 day ago by

authored 6 papers 5 days ago

CHARM: Calibrating Reward Models With Chatbot Arena Scores

Paper • 2504.10045 • Published Apr 14, 2025

XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation

Paper • 2503.22973 • Published Mar 29, 2025

DocHPLT: A Massively Multilingual Document-Level Translation Dataset

Paper • 2508.13079 • Published Aug 18, 2025 • 1

MatheMagic: Generating Dynamic Mathematics Benchmarks Robust to Memorization

Paper • 2510.05962 • Published Oct 7, 2025

HPLT 3.0: Very Large-Scale Multilingual Resources for LLM and MT. Mono- and Bi-lingual Data, Multilingual Evaluation, and Pre-Trained Models

Paper • 2511.01066 • Published Nov 2, 2025 • 2

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

Paper • 2606.06428 • Published 21 days ago • 25

in utter-project/mHuBERT-147 23 days ago

Fix YAML language metadata issue for Norwegian (no)

#16 opened 2 months ago by

authored a paper about 1 month ago

Combining On-Policy Optimization and Distillation for Long-Context Reasoning in Large Language Models

Paper • 2605.12227 • Published May 12 • 1

updated a dataset about 1 month ago

utter-project/LongBlocks

Viewer • Updated May 13 • 194k • 1.48k • 7

published a dataset about 1 month ago

utter-project/LongBlocks

Viewer • Updated May 13 • 194k • 1.48k • 7

authored a paper 2 months ago

BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation

Paper • 2604.09497 • Published Apr 10 • 29

authored a paper 2 months ago

BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation

Paper • 2604.09497 • Published Apr 10 • 29

submitted a paper to Daily Papers 2 months ago

BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation

Paper • 2604.09497 • Published Apr 10 • 29

submitted a paper to Daily Papers 3 months ago

BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs

Paper • 2604.02045 • Published Apr 2 • 38

authored 2 papers 3 months ago

EuroLLM-22B: Technical Report

Paper • 2602.05879 • Published Feb 5 • 3

BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs

Paper • 2604.02045 • Published Apr 2 • 38

in utter-project/mHuBERT-147 3 months ago

Error while loading the checkpoint with fairseq

#15 opened 4 months ago by

updated a model 4 months ago

utter-project/EuroMoE-2.6B-A0.6B-Instruct-2512

Text Generation • 3B • Updated Feb 15 • 542 • 12

in utter-project/EuroVLM-9B-Preview 4 months ago

Critical requirement for upcoming releases: Prioritization of European linguistic variants

#1 opened 4 months ago by