HeyDunaX commited on
Commit
d978277
·
verified ·
1 Parent(s): ee630d9

add model card

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ language:
4
+ - vi
5
+ - ede
6
+
7
+ tags:
8
+ - splade
9
+ - unigram
10
+ - sentencepiece
11
+ - information-retrieval
12
+ - cross-lingual-retrieval
13
+ - EViRAL
14
+ ---
15
+
16
+ # SPLADE + Unigram Tokenizer — EViRAL
17
+
18
+ Cross-lingual retrieval model for:
19
+
20
+ Ede query → Vietnamese passage retrieval
21
+
22
+ ## Files
23
+
24
+ | File | Description |
25
+ |---|---|
26
+ | mlm.pt | MLM pre-trained SPLADE encoder |
27
+ | align.pt | Cross-lingual aligned encoder |
28
+ | finetune.pt | Fine-tuned retrieval encoder |
29
+ | spm_tokenizer/spm.model | SentencePiece unigram model |
30
+ | spm_tokenizer/spm.vocab | SentencePiece vocabulary |
31
+
32
+ ## Tokenizer
33
+
34
+ Tokenizer type: SentencePiece Unigram