bobbyw commited on
Commit
e822587
·
verified ·
1 Parent(s): b5ab954

End of training

Browse files
README.md CHANGED
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [bobbyw/deberta-v3-large_faster_learning_v2](https://huggingface.co/bobbyw/deberta-v3-large_faster_learning_v2) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.0915
24
  - Accuracy: 0.0248
25
- - F1: 0.0277
26
- - Precision: 0.0142
27
- - Recall: 0.6364
28
  - Learning Rate: 0.0
29
 
30
  ## Model description
@@ -44,26 +44,22 @@ More information needed
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
- - learning_rate: 2e-06
48
  - train_batch_size: 3
49
  - eval_batch_size: 3
50
  - seed: 42
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
  - lr_scheduler_type: linear
53
- - num_epochs: 8
54
 
55
  ### Training results
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall | Rate |
58
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|:------:|
59
- | 0.0652 | 1.0 | 689 | 0.0845 | 0.0198 | 0.0314 | 0.0160 | 0.7273 | 0.0018 |
60
- | 0.0644 | 2.0 | 1378 | 0.0888 | 0.0228 | 0.0296 | 0.0151 | 0.6818 | 0.0015 |
61
- | 0.0582 | 3.0 | 2067 | 0.0920 | 0.0238 | 0.0315 | 0.0161 | 0.7273 | 0.0012 |
62
- | 0.0536 | 4.0 | 2756 | 0.0849 | 0.0248 | 0.0277 | 0.0142 | 0.6364 | 0.001 |
63
- | 0.0559 | 5.0 | 3445 | 0.1012 | 0.0308 | 0.0298 | 0.0152 | 0.6818 | 0.0008 |
64
- | 0.0466 | 6.0 | 4134 | 0.0948 | 0.0268 | 0.0316 | 0.0161 | 0.7273 | 0.0005 |
65
- | 0.0436 | 7.0 | 4823 | 0.0957 | 0.0278 | 0.0297 | 0.0152 | 0.6818 | 0.0003 |
66
- | 0.0447 | 8.0 | 5512 | 0.0915 | 0.0248 | 0.0277 | 0.0142 | 0.6364 | 0.0 |
67
 
68
 
69
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [bobbyw/deberta-v3-large_faster_learning_v2](https://huggingface.co/bobbyw/deberta-v3-large_faster_learning_v2) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.0991
24
  - Accuracy: 0.0248
25
+ - F1: 0.0238
26
+ - Precision: 0.0122
27
+ - Recall: 0.5455
28
  - Learning Rate: 0.0
29
 
30
  ## Model description
 
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
+ - learning_rate: 1e-06
48
  - train_batch_size: 3
49
  - eval_batch_size: 3
50
  - seed: 42
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
  - lr_scheduler_type: linear
53
+ - num_epochs: 4
54
 
55
  ### Training results
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall | Rate |
58
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|:------:|
59
+ | 0.0174 | 1.0 | 689 | 0.1018 | 0.0238 | 0.0238 | 0.0122 | 0.5455 | 0.0008 |
60
+ | 0.019 | 2.0 | 1378 | 0.1014 | 0.0248 | 0.0258 | 0.0132 | 0.5909 | 0.0005 |
61
+ | 0.0182 | 3.0 | 2067 | 0.0979 | 0.0228 | 0.0238 | 0.0122 | 0.5455 | 0.0003 |
62
+ | 0.0171 | 4.0 | 2756 | 0.0991 | 0.0248 | 0.0238 | 0.0122 | 0.5455 | 0.0 |
 
 
 
 
63
 
64
 
65
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4d04d84fa8d8a9c9a4fdeecfae53f09c8a152453a41e46fdf197409154a409c9
3
  size 1740120184
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b9411a2f0b5608ff3c03767c11d019c1f857a9987f849a2e17298cea09255d09
3
  size 1740120184
runs/Jun12_17-44-15_c86ebda74365/events.out.tfevents.1718214257.c86ebda74365.349.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8015b3cb56113e083c09d2f19601c02443725f9987c59cceda58deca0d33c84e
3
- size 8566
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ee52e90f8361e3a509d1e4d0bc22a2058c981b4f7079e116807645d29d03cef
3
+ size 9508