Samples from the ToPXGen-LLaMA-4-Scout English to Xhosa set augmented with intermediate information generated by LLaMA-4-Scout.
AI & ML interests
NLP, Digital Humanities
Recent Activity
View all activity
Papers
Disentangling meaning from language in LLM-based machine translation
Gaperon: A Peppered English-French Generative Language Model Suite
Samples from the ToPXGen-LLaMA-4-Scout English to Xhosa set augmented with intermediate information generated by LLaMA-4-Scout.
Samples from the WMT19 English to Lithuanian set augmented with intermediate information generated by gemma-3-27b-it.
Collections of models trained on the TopXGen dataset.
Our French-English LLM suite (including Base and SFT models. All checkpoints are also included.