File size: 528 Bytes
87fb80e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
---
license: mit
datasets:
- unicamp-dl/mmarco
language:
- ru
base_model:
- unicamp-dl/mt5-base-mmarco-v2
---
# mt5-base Reranker RU mMARCO/v2 Transliterated Queries
This is a variation of Unicamp's [mt5-base Reranker](https://huggingface.co/unicamp-dl/mt5-base-mmarco-v2) initially finetuned on mMARCOv/2.
The queries transliterated Russian to English text using [uroman](https://github.com/isi-nlp/uroman).
The model was used for the SIGIR 2025 Short paper: Lost in Transliteration: Bridging the Script Gap in Neural IR.
|