andreaschari's picture
Create README.md
87fb80e verified
---
license: mit
datasets:
- unicamp-dl/mmarco
language:
- ru
base_model:
- unicamp-dl/mt5-base-mmarco-v2
---
# mt5-base Reranker RU mMARCO/v2 Transliterated Queries
This is a variation of Unicamp's [mt5-base Reranker](https://huggingface.co/unicamp-dl/mt5-base-mmarco-v2) initially finetuned on mMARCOv/2.
The queries transliterated Russian to English text using [uroman](https://github.com/isi-nlp/uroman).
The model was used for the SIGIR 2025 Short paper: Lost in Transliteration: Bridging the Script Gap in Neural IR.