akkaro commited on
Commit
1428880
·
verified ·
1 Parent(s): b8732ba

End of training

Browse files
README.md CHANGED
@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 4.4131
20
- - Wer Score: 24.1207
21
 
22
  ## Model description
23
 
@@ -37,25 +37,24 @@ More information needed
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 5e-05
40
- - train_batch_size: 4
41
- - eval_batch_size: 4
42
  - seed: 42
43
  - gradient_accumulation_steps: 2
44
- - total_train_batch_size: 8
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
- - num_epochs: 5
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Wer Score |
53
  |:-------------:|:------:|:----:|:---------------:|:---------:|
54
- | 6.6954 | 0.9474 | 9 | 6.1194 | 24.2241 |
55
- | 5.8759 | 1.8947 | 18 | 5.4065 | 24.1552 |
56
- | 5.2401 | 2.8421 | 27 | 4.8795 | 24.1379 |
57
- | 4.7997 | 3.7895 | 36 | 4.5462 | 24.1034 |
58
- | 4.5467 | 4.7368 | 45 | 4.4131 | 24.1207 |
59
 
60
 
61
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.0573
20
+ - Wer Score: 0.6707
21
 
22
  ## Model description
23
 
 
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 5e-05
40
+ - train_batch_size: 8
41
+ - eval_batch_size: 8
42
  - seed: 42
43
  - gradient_accumulation_steps: 2
44
+ - total_train_batch_size: 16
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 10
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Wer Score |
53
  |:-------------:|:------:|:----:|:---------------:|:---------:|
54
+ | 7.4065 | 2.1277 | 50 | 4.7483 | 20.4625 |
55
+ | 2.9029 | 4.2553 | 100 | 1.1386 | 12.1695 |
56
+ | 0.5188 | 6.3830 | 150 | 0.1472 | 0.7676 |
57
+ | 0.0871 | 8.5106 | 200 | 0.0573 | 0.6707 |
 
58
 
59
 
60
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:23dda9102c7814f77d42aeefd08d5ef21e044202cab67a72c3be8987a0e19b01
3
  size 706516040
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:32339843a69f87e810c653e3e556dddbfd50349a9fa918af77f51019522d79a9
3
  size 706516040
runs/Oct01_07-37-45_cbf5861e1559/events.out.tfevents.1727768270.cbf5861e1559.6782.4 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9c6cc5b05d4d429b986eb2d20aad051d6b4c6a1ab4d1677debe21b39ea7b827d
3
- size 7247
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:205f916e9ca532e9015426cf02bc72f8a11bfe858c56adbbab89acb38fcbfa69
3
+ size 7601