DLL888 commited on
Commit
11d44e7
1 Parent(s): 127321e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -8
README.md CHANGED
@@ -12,15 +12,10 @@ probably proofread and complete it, then remove this comment. -->
12
 
13
  # DLL888/bert-base-uncased-squad
14
 
15
- This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Train Loss: 0.8072
18
- - Train End Logits Accuracy: 0.7735
19
- - Train Start Logits Accuracy: 0.7320
20
- - Validation Loss: 0.9990
21
- - Validation End Logits Accuracy: 0.7302
22
- - Validation Start Logits Accuracy: 0.6983
23
- - Epoch: 1
24
 
25
  ## Model description
26
 
@@ -36,6 +31,22 @@ More information needed
36
 
37
  ## Training procedure
38
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
39
  ### Training hyperparameters
40
 
41
  The following hyperparameters were used during training:
 
12
 
13
  # DLL888/bert-base-uncased-squad
14
 
15
+ This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on [SQuAD](https://huggingface.co/datasets/squad) dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Exact Match: 80.21759697256385
18
+ - F1: 87.77849998885436
 
 
 
 
 
19
 
20
  ## Model description
21
 
 
31
 
32
  ## Training procedure
33
 
34
+ ### Training Machine
35
+
36
+ Trained in Google Colab Pro with the following specs:
37
+ +-----------------------------------------------------------------------------+
38
+ | NVIDIA-SMI 460.32.03 Driver Version: 460.32.03 CUDA Version: 11.2 |
39
+ |-------------------------------+----------------------+----------------------+
40
+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
41
+ | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
42
+ | | | MIG M. |
43
+ |===============================+======================+======================|
44
+ | 0 A100-SXM4-40GB Off | 00000000:00:04.0 Off | 0 |
45
+ | N/A 34C P0 56W / 400W | 0MiB / 40536MiB | 0% Default |
46
+ | | | Disabled |
47
+ +-------------------------------+----------------------+----------------------+
48
+ Training took about 26 minutes for two epochs.
49
+
50
  ### Training hyperparameters
51
 
52
  The following hyperparameters were used during training: