SPLADE + Morpheme Tokenizer โ€” EViRAL

Cross-lingual retrieval model:

Ede query โ†’ Vietnamese passage retrieval

Architecture

  • Backbone: 6-layer Transformer
  • Retrieval head: SPLADE sparse expansion
  • Tokenizer: Morpheme tokenizer v4

Files

File Description
mlm.pt MLM pre-trained encoder
align.pt Cross-lingual aligned encoder
finetune.pt Fine-tuned retrieval encoder

Notes

This model replaces vanilla dense retrieval with SPLADE sparse lexical expansion while keeping the same Transformer backbone.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Collection including NIRVLab/splade_morpheme