andreaschari's picture
Create README.md
87fb80e verified
metadata
license: mit
datasets:
  - unicamp-dl/mmarco
language:
  - ru
base_model:
  - unicamp-dl/mt5-base-mmarco-v2

mt5-base Reranker RU mMARCO/v2 Transliterated Queries

This is a variation of Unicamp's mt5-base Reranker initially finetuned on mMARCOv/2.

The queries transliterated Russian to English text using uroman.

The model was used for the SIGIR 2025 Short paper: Lost in Transliteration: Bridging the Script Gap in Neural IR.