EfficientRAG Filter (mdeberta-v3-base)

Filter component of EfficientRAG — constructs next-hop queries via token selection.

What it does

Given the original question + extracted useful tokens, the Filter selects which tokens to keep in the next retrieval query. This is extractive (no generation) — it picks words from the input.

Architecture

  • Base: microsoft/mdeberta-v3-base (86M params, multilingual)
  • Standard DebertaV2ForTokenClassification with 2 labels (keep/discard)

Training

Data 5,691 samples (HotpotQA EN + Dragon-derec RU)
Epochs 2
Batch size 4
LR 1e-5
Max length 128
Hardware Apple M3 Pro, ~17 minutes

Usage

Related

Downloads last month
24
Safetensors
Model size
0.3B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Necent/efficientrag-filter-mdeberta-v3-base

Finetuned
(262)
this model

Paper for Necent/efficientrag-filter-mdeberta-v3-base