splade_unigram / README.md
HeyDunaX's picture
add model card
d978277 verified
metadata
language:
  - vi
  - ede
tags:
  - splade
  - unigram
  - sentencepiece
  - information-retrieval
  - cross-lingual-retrieval
  - EViRAL

SPLADE + Unigram Tokenizer — EViRAL

Cross-lingual retrieval model for:

Ede query → Vietnamese passage retrieval

Files

File Description
mlm.pt MLM pre-trained SPLADE encoder
align.pt Cross-lingual aligned encoder
finetune.pt Fine-tuned retrieval encoder
spm_tokenizer/spm.model SentencePiece unigram model
spm_tokenizer/spm.vocab SentencePiece vocabulary

Tokenizer

Tokenizer type: SentencePiece Unigram