File size: 3,821 Bytes
4478821
 
 
 
d5fb657
 
 
 
 
 
 
4478821
 
bf9c36b
 
 
 
 
 
 
 
 
 
d5fb657
bf9c36b
d5fb657
bf9c36b
d5fb657
 
bf9c36b
d5fb657
bf9c36b
d5fb657
92d1d2a
 
 
 
 
 
 
 
 
d5fb657
92d1d2a
d5fb657
92d1d2a
d5fb657
 
92d1d2a
d5fb657
92d1d2a
d5fb657
e49ab07
 
 
 
 
 
 
 
 
d5fb657
e49ab07
d5fb657
e49ab07
d5fb657
 
e49ab07
d5fb657
e49ab07
d5fb657
4478821
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
---
license: mit
tags:
- generated_from_trainer
datasets:
- squad_v2
- quoref
- adversarial_qa
- duorc
task:
- question-answering
model-index:
- name: rob-base-superqa
  results:
  - task:
      type: question-answering
      name: Question Answering
    dataset:
      name: adversarial_qa
      type: adversarial_qa
      config: adversarialQA
      split: validation
    metrics:
    - type: exact_match
      value: 43.8667
      name: Exact Match
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYzIxMWZiZWM1MTJmMGIxM2I5NTFjNGI5OTJiNDdjODQ3NDNkYjRkYTI3ZmZkNGVmMGYzZDk5MTZhNDE4YzI1YiIsInZlcnNpb24iOjF9.QAj_iwD0yN2woSbGAN9xVRKoDKxldZbleFeJr77P2s7xWQBsKCuY0b5-2WIL79EcTCChvjNITeriPXqz8mGMAw
    - type: f1
      value: 55.135
      name: F1
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjJkMzZjNTVhZTI5OTVhNTU4NDcyMjM1ZWJiODVjNzBhODRmZjlmMjE0MDUzMmU4NzNlNzA5NjgyODdkNTJmZSIsInZlcnNpb24iOjF9.O0KoLquXYbF3P2PGCFW8bxYEVe_yDW-WzEqpOmbIs_e9v4tcygH19ZUYFjMDFSll91SPJ2oIbVovsUISYuknCg
  - task:
      type: question-answering
      name: Question Answering
    dataset:
      name: squad_v2
      type: squad_v2
      config: squad_v2
      split: validation
    metrics:
    - type: exact_match
      value: 79.2432
      name: Exact Match
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjBjZjhjMTMzMzZhOTg1OGYyZDY2MzZjYmQ4NmNlNWI5MWNmNTBiZjY1Njg0YTYyMmRlNzlkZDU1NTZjOWM5ZCIsInZlcnNpb24iOjF9.1vo9JoASJ_zvOVa4lTRMNPljUvMon-E6QOZ1n_KFQBMtRvRY883ECudhAzb5LGpLntyM2EN5bfyfTQ6dfjjsDg
    - type: f1
      value: 82.336
      name: F1
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOWFiZGMyMzkwOTlkMWVkZmExNjdkZTM1YjRkYzRkZDlhOGZlMjEwMGNjNjJhYjM5MjZlNDI3ZDEyNmViOGYyOSIsInZlcnNpb24iOjF9.f3xlhop8hXWCCWFXWZgyK9r8Cy5KE3gPgYNV3bRN78teN_hjYH5sDl4wMTMcPU-bsPX70_wvsuvU-r95ByF4Bg
  - task:
      type: question-answering
      name: Question Answering
    dataset:
      name: quoref
      type: quoref
      config: default
      split: validation
    metrics:
    - type: exact_match
      value: 78.8581
      name: Exact Match
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZjMyYTAxOWJhYTM5YWNmNGFhZDg3NTIwN2UxN2RhYzQxYzFiODJjYTcyZTk5MGMwODNhMzA3Nzc3MDQzYjcwMiIsInZlcnNpb24iOjF9.FSNswUf1Y5ZnlS0fSm-lxsA1klUphzfDhfj00U5benVd0QiYvyeqRclC7Pw8B3RV9Oe1cZzfeDDA5fXY2A5JBw
    - type: f1
      value: 82.8261
      name: F1
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNWQyMTNhZTc0MTdiMzNiNzc3YzhkNTk5ZWRkMWZlYjc4ZGU3YTFkNDkyZDg0NWFiYzFhMGQyMzZjYjcwNTE1YSIsInZlcnNpb24iOjF9.9waqQm_EBPo41pdOMmoY6r_-K7-3zUxt1AB4ndHTY50S5k5yyub8NdCJz09hBhbRd1_-1t3UT5p8HnFjAjF9DQ
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# rob-base-superqa

This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 7e-05
- train_batch_size: 32
- eval_batch_size: 32
- seed: 42
- distributed_type: multi-GPU
- num_devices: 8
- total_train_batch_size: 256
- total_eval_batch_size: 256
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-06
- lr_scheduler_type: linear
- num_epochs: 3.0

### Training results



### Framework versions

- Transformers 4.21.1
- Pytorch 1.11.0a0+gita4c10ee
- Datasets 2.4.0
- Tokenizers 0.12.1