emilnuutinen commited on
Commit
a5a9463
·
1 Parent(s): c3520df

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +60 -0
README.md ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - squad_v2
4
+ language:
5
+ - fi
6
+ metrics:
7
+ - accuracy
8
+ - f1
9
+ ---
10
+ # bert-base-finnish-cased-v1 for QA
11
+
12
+ This is the [bert-base-finnish-cased-v1](https://huggingface.co/TurkuNLP/bert-base-finnish-cased-v1/tree/main) model,
13
+ fine-tuned using the [Finnish SQuAD2.0](https://huggingface.co/datasets/TurkuNLP/squad_v2_fi) dataset. It's been trained on
14
+ question-answer pairs, including unanswerable questions, for the task of Question Answering.
15
+
16
+ ## Overview
17
+
18
+ **Language model:** bert-base-finnish-v1
19
+
20
+ **Language:** Finnish
21
+
22
+ **Downstream-task:** Extractive QA
23
+
24
+ **Training data:** squad_v2_fi
25
+
26
+ **Eval data:** squad_v2_fi
27
+
28
+ ## Hyperparameters
29
+
30
+ ```
31
+ batch_size = 12
32
+ n_epochs = 2
33
+ base_LM_model = "bert-base-finnish-cased-v1"
34
+ max_seq_len = 384
35
+ learning_rate = 3e-5
36
+ doc_stride=128
37
+ ```
38
+
39
+ ## Performance
40
+
41
+ ```
42
+ "exact": 68.2424,
43
+ "f1": 73.6607,
44
+ "HasAns_exact": 58.3290,
45
+ "HasAns_f1": 69.3629,
46
+ "HasAns_total": 5817,
47
+ "NoAns_exact": 77.8073,
48
+ "NoAns_f1": 77.8073,
49
+ "NoAns_total": 6029,
50
+ "samples": 12138,
51
+ "total": 11846
52
+ ```
53
+
54
+ ## Comparison with other Finnish models
55
+
56
+ | model | F1 |
57
+ | ------------------------------------------ | ----: |
58
+ | TurkuNLP/bert-base-finnish-cased-squad2 | 73.66 |
59
+ | ilmariky/bert-base-finnish-cased-squad2-fi | 61.87 |
60
+