Add evaluation results on the adversarialQA config of adversarial_qa
#2
by
autoevaluator
HF Staff
- opened
README.md
CHANGED
|
@@ -11,6 +11,26 @@ datasets:
|
|
| 11 |
metrics:
|
| 12 |
- exact_match
|
| 13 |
- f1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
---
|
| 15 |
|
| 16 |
# Model Overview
|
|
@@ -24,4 +44,4 @@ Evaluation data: SQuAD + AdversarialQA
|
|
| 24 |
Approx. 1 training epoch on the synthetic data and 2 training epochs on the manually-curated data.
|
| 25 |
|
| 26 |
# Additional Information
|
| 27 |
-
Please refer to https://arxiv.org/abs/2104.08678 for full details. You can interact with the model on Dynabench here: https://dynabench.org/models/109
|
|
|
|
| 11 |
metrics:
|
| 12 |
- exact_match
|
| 13 |
- f1
|
| 14 |
+
model-index:
|
| 15 |
+
- name: mbartolo/electra-large-synqa
|
| 16 |
+
results:
|
| 17 |
+
- task:
|
| 18 |
+
type: question-answering
|
| 19 |
+
name: Question Answering
|
| 20 |
+
dataset:
|
| 21 |
+
name: adversarial_qa
|
| 22 |
+
type: adversarial_qa
|
| 23 |
+
config: adversarialQA
|
| 24 |
+
split: validation
|
| 25 |
+
metrics:
|
| 26 |
+
- name: Exact Match
|
| 27 |
+
type: exact_match
|
| 28 |
+
value: 54.3667
|
| 29 |
+
verified: true
|
| 30 |
+
- name: F1
|
| 31 |
+
type: f1
|
| 32 |
+
value: 67.3589
|
| 33 |
+
verified: true
|
| 34 |
---
|
| 35 |
|
| 36 |
# Model Overview
|
|
|
|
| 44 |
Approx. 1 training epoch on the synthetic data and 2 training epochs on the manually-curated data.
|
| 45 |
|
| 46 |
# Additional Information
|
| 47 |
+
Please refer to https://arxiv.org/abs/2104.08678 for full details. You can interact with the model on Dynabench here: https://dynabench.org/models/109
|