splade_morpheme / README.md
HeyDunaX's picture
add model card
5edc04a verified
metadata
language:
  - vi
  - ede
tags:
  - splade
  - morpheme-tokenizer
  - sparse-retrieval
  - cross-lingual-retrieval
  - information-retrieval
  - EViRAL

SPLADE + Morpheme Tokenizer — EViRAL

Cross-lingual retrieval model:

Ede query → Vietnamese passage retrieval

Architecture

  • Backbone: 6-layer Transformer
  • Retrieval head: SPLADE sparse expansion
  • Tokenizer: Morpheme tokenizer v4

Files

File Description
mlm.pt MLM pre-trained encoder
align.pt Cross-lingual aligned encoder
finetune.pt Fine-tuned retrieval encoder

Notes

This model replaces vanilla dense retrieval with SPLADE sparse lexical expansion while keeping the same Transformer backbone.