File size: 687 Bytes
3b47038
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
language:
- vi
- ede
tags:
- cross-lingual-retrieval
- morpheme-tokenizer
- colbert
- EViRAL
---

# ColBERT Transformer + Morpheme Tokenizer v4 — EViRAL

Task: Ede query → Vietnamese passage retrieval

## Eval Results

| Metric  | Validation | Test   |
|---------|-----------|--------|
| nDCG@1  | 0.0004    | 0.0005 |
| nDCG@5  | 0.0011    | 0.0013 |
| nDCG@10 | 0.0017    | 0.0022 |
| MRR@10  | 0.0020    | 0.0023 |
| R@50    | 0.0202    | 0.0210 |
| R@100   | 0.0345    | 0.0374 |

## Checkpoints
| file | description |
|---|---|
| mlm.pt | MLM ColBERT encoder weights |
| align.pt | cross-lingual aligned encoder |
| finetune.pt | contrastive fine-tuned encoder (best val) |