EfficientRAG Labeler (mdeberta-v3-base)

Labeler component of EfficientRAG — dual-headed DeBERTa model for multi-hop retrieval.

What it does

Given a query and a retrieved chunk, the Labeler:

Sequence classification: Is this chunk relevant (CONTINUE) or irrelevant (TERMINATE)?
Token classification: Which tokens in the chunk are useful for answering?

Base: microsoft/mdeberta-v3-base (86M params, multilingual)
Custom dual head: DebertaForSequenceTokenClassification
- Token head: binary per-token (useful/useless)
- Sequence head: binary per-chunk (CONTINUE/TERMINATE)


Data	30,818 samples (HotpotQA EN + Dragon-derec RU)
Epochs	2
Batch size	4
LR	5e-6
Max length	384
Hardware	Apple M3 Pro, ~3.4 hours

Metric	Baseline	EfficientRAG	Delta
MRR (multi-hop)	0.736	0.798	+0.062
MRR (overall)	0.783	0.822	+0.040
Precision	0.187	0.582	+0.395

Safetensors

Model size

0.3B params

Tensor type

F32

Base model

Finetuned

(284)

this model