HeyDunaX commited on
Commit
5edc04a
·
verified ·
1 Parent(s): 8f023d7

add model card

Browse files
Files changed (1) hide show
  1. README.md +40 -0
README.md ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ language:
4
+ - vi
5
+ - ede
6
+
7
+ tags:
8
+ - splade
9
+ - morpheme-tokenizer
10
+ - sparse-retrieval
11
+ - cross-lingual-retrieval
12
+ - information-retrieval
13
+ - EViRAL
14
+ ---
15
+
16
+ # SPLADE + Morpheme Tokenizer — EViRAL
17
+
18
+ Cross-lingual retrieval model:
19
+
20
+ Ede query → Vietnamese passage retrieval
21
+
22
+ ## Architecture
23
+
24
+ - Backbone: 6-layer Transformer
25
+ - Retrieval head: SPLADE sparse expansion
26
+ - Tokenizer: Morpheme tokenizer v4
27
+
28
+ ## Files
29
+
30
+ | File | Description |
31
+ |---|---|
32
+ | mlm.pt | MLM pre-trained encoder |
33
+ | align.pt | Cross-lingual aligned encoder |
34
+ | finetune.pt | Fine-tuned retrieval encoder |
35
+
36
+ ## Notes
37
+
38
+ This model replaces vanilla dense retrieval
39
+ with SPLADE sparse lexical expansion while
40
+ keeping the same Transformer backbone.