crodri commited on
Commit
2a4423d
·
1 Parent(s): 838fc6d

updated VilaQUAD

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -138,6 +138,8 @@ It contains the following tasks and their related datasets:
138
 
139
  **[ViquiQuAD](https://doi.org/10.5281/zenodo.4562344)**: consisting of more than 15,000 questions outsourced from Catalan Wikipedia randomly chosen from a set of 596 articles that were originally written in Catalan.
140
 
 
 
141
  **[XQuAD](https://doi.org/10.5281/zenodo.4526223)**: the Catalan translation of XQuAD, a multilingual collection of manual translations of 1,190 question-answer pairs from English Wikipedia used only as a _test set_
142
 
143
  Here are the train/dev/test splits of the datasets:
@@ -149,6 +151,7 @@ Here are the train/dev/test splits of the datasets:
149
  | STS | 3,073 | 2,073 | 500 | 500 |
150
  | TC (TeCla) | 137,775 | 110,203 | 13,786 | 13,786|
151
  | QA (ViquiQuAD) | 14,239 | 11,255 | 1,492 | 1,429 |
 
152
 
153
  ### Evaluation Results
154
 
 
138
 
139
  **[ViquiQuAD](https://doi.org/10.5281/zenodo.4562344)**: consisting of more than 15,000 questions outsourced from Catalan Wikipedia randomly chosen from a set of 596 articles that were originally written in Catalan.
140
 
141
+ **[VilaQuAD] (https://doi.org/10.5281/zenodo.4562337)** contains 6282 pairs of questions and answers, outsourced from 2095 Catalan language articles from VilaWeb newswire text.
142
+
143
  **[XQuAD](https://doi.org/10.5281/zenodo.4526223)**: the Catalan translation of XQuAD, a multilingual collection of manual translations of 1,190 question-answer pairs from English Wikipedia used only as a _test set_
144
 
145
  Here are the train/dev/test splits of the datasets:
 
151
  | STS | 3,073 | 2,073 | 500 | 500 |
152
  | TC (TeCla) | 137,775 | 110,203 | 13,786 | 13,786|
153
  | QA (ViquiQuAD) | 14,239 | 11,255 | 1,492 | 1,429 |
154
+ | QA (VilaQuAD) | 6,282 | 3,882 | 1,200 | 1,200 |
155
 
156
  ### Evaluation Results
157