RyyDer commited on
Commit
01f2bc0
verified
1 Parent(s): f52568c

End of training

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [deepset/deberta-v3-large-squad2](https://huggingface.co/deepset/deberta-v3-large-squad2) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.0454
20
 
21
  ## Model description
22
 
@@ -44,18 +44,16 @@ The following hyperparameters were used during training:
44
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
46
  - lr_scheduler_warmup_steps: 10
47
- - num_epochs: 5
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
- | 1.5817 | 1.0 | 1 | 2.5048 |
55
- | 1.5733 | 2.0 | 2 | 2.5048 |
56
- | 1.3207 | 3.0 | 3 | 2.1915 |
57
- | 1.3188 | 4.0 | 4 | 1.5524 |
58
- | 0.8137 | 5.0 | 5 | 1.0454 |
59
 
60
 
61
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [deepset/deberta-v3-large-squad2](https://huggingface.co/deepset/deberta-v3-large-squad2) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 2.1946
20
 
21
  ## Model description
22
 
 
44
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
46
  - lr_scheduler_warmup_steps: 10
47
+ - num_epochs: 3
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
+ | 1.2435 | 1.0 | 1 | 2.5058 |
55
+ | 1.4247 | 2.0 | 2 | 2.5058 |
56
+ | 1.3968 | 3.0 | 3 | 2.1946 |
 
 
57
 
58
 
59
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d196dfafd38829c99e36e40c0e2f1fdb93073a72abc7c89ff147a1f8158e3e15
3
  size 1736105880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:609ecd370caab676772e5ab997c9b6eec79bcf86ef0bb0f4788fcf05c188c422
3
  size 1736105880
runs/Jun02_10-09-13_75737878059b/events.out.tfevents.1748858957.75737878059b.634.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:685d0d57749ec2d3c21ec6ee0d904055701f9578be961a42ba87548d01503c2a
3
+ size 7080
runs/Jun02_10-11-31_75737878059b/events.out.tfevents.1748859093.75737878059b.634.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:845b8694b4b2ad8574e1a7d45b8cf023e29ddaf6cfee89d050e0c013aac6cce2
3
+ size 7080
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2f2eb497b37edc8017a485e0cdbf38af4c9d78d502ffa4a3e0e9e641e0d500ff
3
  size 5304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a0404426b6f6a84cc47dab0b5946311510a49cdfba63628d14721add478129d
3
  size 5304