File size: 700 Bytes
5edc04a | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 |
---
language:
- vi
- ede
tags:
- splade
- morpheme-tokenizer
- sparse-retrieval
- cross-lingual-retrieval
- information-retrieval
- EViRAL
---
# SPLADE + Morpheme Tokenizer — EViRAL
Cross-lingual retrieval model:
Ede query → Vietnamese passage retrieval
## Architecture
- Backbone: 6-layer Transformer
- Retrieval head: SPLADE sparse expansion
- Tokenizer: Morpheme tokenizer v4
## Files
| File | Description |
|---|---|
| mlm.pt | MLM pre-trained encoder |
| align.pt | Cross-lingual aligned encoder |
| finetune.pt | Fine-tuned retrieval encoder |
## Notes
This model replaces vanilla dense retrieval
with SPLADE sparse lexical expansion while
keeping the same Transformer backbone.
|