hiudev commited on
Commit
8028d45
·
verified ·
1 Parent(s): e199303

Initial release

Browse files
Files changed (3) hide show
  1. README.md +18 -16
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -14,8 +14,8 @@ model-index:
14
  - name: banking77-deBERTa-v3-base
15
  results:
16
  - task:
17
- type: text-classification
18
  name: Text Classification
 
19
  dataset:
20
  name: banking77
21
  type: banking77
@@ -23,9 +23,9 @@ model-index:
23
  split: train
24
  args: default
25
  metrics:
26
- - type: accuracy
27
- value: 0.9225387306346826
28
- name: Accuracy
29
  ---
30
 
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -35,14 +35,14 @@ should probably proofread and complete it, then remove this comment. -->
35
 
36
  This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on the banking77 dataset.
37
  It achieves the following results on the evaluation set:
38
- - Loss: 0.3388
39
- - Accuracy: 0.9225
40
- - F1 Macro: 0.9207
41
- - Precision Macro: 0.9261
42
- - Recall Macro: 0.9193
43
- - F1 Weighted: 0.9221
44
- - Precision Weighted: 0.9252
45
- - Recall Weighted: 0.9225
46
 
47
  ## Model description
48
 
@@ -68,15 +68,17 @@ The following hyperparameters were used during training:
68
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
69
  - lr_scheduler_type: linear
70
  - lr_scheduler_warmup_steps: 500
71
- - num_epochs: 3
72
 
73
  ### Training results
74
 
75
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | Precision Macro | Recall Macro | F1 Weighted | Precision Weighted | Recall Weighted |
76
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------------:|:------------:|:-----------:|:------------------:|:---------------:|
77
- | 0.9407 | 1.0 | 501 | 0.8110 | 0.8351 | 0.8093 | 0.8304 | 0.8144 | 0.8263 | 0.8412 | 0.8351 |
78
- | 0.4485 | 2.0 | 1002 | 0.4066 | 0.9055 | 0.8958 | 0.9069 | 0.8941 | 0.9041 | 0.9093 | 0.9055 |
79
- | 0.2692 | 3.0 | 1503 | 0.3388 | 0.9225 | 0.9207 | 0.9261 | 0.9193 | 0.9221 | 0.9252 | 0.9225 |
 
 
80
 
81
 
82
  ### Framework versions
 
14
  - name: banking77-deBERTa-v3-base
15
  results:
16
  - task:
 
17
  name: Text Classification
18
+ type: text-classification
19
  dataset:
20
  name: banking77
21
  type: banking77
 
23
  split: train
24
  args: default
25
  metrics:
26
+ - name: Accuracy
27
+ type: accuracy
28
+ value: 0.9195402298850575
29
  ---
30
 
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
35
 
36
  This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on the banking77 dataset.
37
  It achieves the following results on the evaluation set:
38
+ - Loss: 0.3281
39
+ - Accuracy: 0.9195
40
+ - F1 Macro: 0.9170
41
+ - Precision Macro: 0.9222
42
+ - Recall Macro: 0.9159
43
+ - F1 Weighted: 0.9194
44
+ - Precision Weighted: 0.9229
45
+ - Recall Weighted: 0.9195
46
 
47
  ## Model description
48
 
 
68
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
69
  - lr_scheduler_type: linear
70
  - lr_scheduler_warmup_steps: 500
71
+ - num_epochs: 5
72
 
73
  ### Training results
74
 
75
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | Precision Macro | Recall Macro | F1 Weighted | Precision Weighted | Recall Weighted |
76
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------------:|:------------:|:-----------:|:------------------:|:---------------:|
77
+ | 3.4666 | 1.0 | 501 | 3.1762 | 0.3548 | 0.2479 | 0.3016 | 0.3195 | 0.2774 | 0.3421 | 0.3548 |
78
+ | 1.2538 | 2.0 | 1002 | 1.0122 | 0.8141 | 0.7625 | 0.8091 | 0.7795 | 0.7946 | 0.8291 | 0.8141 |
79
+ | 0.5576 | 3.0 | 1503 | 0.4823 | 0.8941 | 0.8797 | 0.9012 | 0.8786 | 0.8915 | 0.9021 | 0.8941 |
80
+ | 0.3544 | 4.0 | 2004 | 0.3625 | 0.9110 | 0.9090 | 0.9170 | 0.9084 | 0.9108 | 0.9172 | 0.9110 |
81
+ | 0.2603 | 5.0 | 2505 | 0.3281 | 0.9195 | 0.9170 | 0.9222 | 0.9159 | 0.9194 | 0.9229 | 0.9195 |
82
 
83
 
84
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:497584b4f9c857dffca420184d39d6d5799a83b107eb9360749392c71f94aedc
3
  size 737949988
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:17435a19f85b188ff17b784f1f203e711ca2d5d2f8b3ddd8da6f1ee881c5071c
3
  size 737949988
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b5fc97f368e1905e2351297a1c44a84a50e3a1e033694e4c9b1cc4e22191d7ac
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:42182bc423f5c98b3410e3dc5143cd47594da824a48eee325f672f64016ced02
3
  size 5240