Necent commited on
Commit
4cf68c6
·
verified ·
1 Parent(s): dd45c80

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +46 -0
README.md ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - ru
5
+ tags:
6
+ - efficientrag
7
+ - multi-hop-qa
8
+ - token-classification
9
+ - deberta-v3
10
+ license: mit
11
+ base_model: microsoft/mdeberta-v3-base
12
+ ---
13
+
14
+ # EfficientRAG Filter (mdeberta-v3-base)
15
+
16
+ **Filter** component of [EfficientRAG](https://arxiv.org/abs/2408.04259) — constructs next-hop queries via token selection.
17
+
18
+ ## What it does
19
+
20
+ Given the original question + extracted useful tokens, the Filter selects which tokens to keep in the next retrieval query. This is extractive (no generation) — it picks words from the input.
21
+
22
+ ## Architecture
23
+
24
+ - Base: `microsoft/mdeberta-v3-base` (86M params, multilingual)
25
+ - Standard `DebertaV2ForTokenClassification` with 2 labels (keep/discard)
26
+
27
+ ## Training
28
+
29
+ | | |
30
+ |--|--|
31
+ | Data | 5,691 samples (HotpotQA EN + Dragon-derec RU) |
32
+ | Epochs | 2 |
33
+ | Batch size | 4 |
34
+ | LR | 1e-5 |
35
+ | Max length | 128 |
36
+ | Hardware | Apple M3 Pro, ~17 minutes |
37
+
38
+ ## Usage
39
+
40
+
41
+
42
+ ## Related
43
+
44
+ - Training data: [Necent/efficientrag-filter-training-data](https://huggingface.co/datasets/Necent/efficientrag-filter-training-data)
45
+ - Labeler model: [Necent/efficientrag-labeler-mdeberta-v3-base](https://huggingface.co/Necent/efficientrag-labeler-mdeberta-v3-base)
46
+ - Paper: [EfficientRAG (arXiv:2408.04259)](https://arxiv.org/abs/2408.04259)