igor commited on
Commit
f115fb2
·
verified ·
1 Parent(s): cdf5b30

End of training

Browse files
Files changed (3) hide show
  1. README.md +9 -8
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -18,13 +18,12 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.8334
22
- - F1: 0.6221
23
 
24
  ## Model description
25
 
26
- WIP
27
- Currently seems to be pretty bad overfit. I have an idea where, data distribution is kind of off (need to drop unknown classifications), so next version will hopefully be better
28
 
29
  ## Intended uses & limitations
30
 
@@ -45,15 +44,17 @@ The following hyperparameters were used during training:
45
  - seed: 42
46
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
47
  - lr_scheduler_type: linear
48
- - num_epochs: 3.0
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | F1 |
53
  |:-------------:|:-----:|:----:|:---------------:|:------:|
54
- | 0.6289 | 1.0 | 1581 | 0.6139 | 0.6443 |
55
- | 0.5648 | 2.0 | 3162 | 0.6683 | 0.6474 |
56
- | 0.4054 | 3.0 | 4743 | 0.8334 | 0.6221 |
 
 
57
 
58
 
59
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.9476
22
+ - F1: 0.6365
23
 
24
  ## Model description
25
 
26
+ More information needed
 
27
 
28
  ## Intended uses & limitations
29
 
 
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 5
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | F1 |
52
  |:-------------:|:-----:|:----:|:---------------:|:------:|
53
+ | 0.6349 | 1.0 | 1539 | 0.6284 | 0.6389 |
54
+ | 0.5473 | 2.0 | 3078 | 0.6628 | 0.6696 |
55
+ | 0.3714 | 3.0 | 4617 | 0.9471 | 0.6583 |
56
+ | 0.2167 | 4.0 | 6156 | 1.4814 | 0.6357 |
57
+ | 0.1402 | 5.0 | 7695 | 1.9476 | 0.6365 |
58
 
59
 
60
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:66164fc0f18f2f37af54def25f58b3662bda71decd8697869a6af3fcec014679
3
  size 267832560
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ec0f25bdc3e18cc2d06129a2ce49d48573dd7c3937e107c3bc9200a9723f855
3
  size 267832560
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7bdc4f260e350111f202df0cfe63eafefb487feff349a5e965e96662cb340845
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a957b3e4e7f9addc152545db3cf3a8da8cc25fd00e00e13c39b8c75447da44d7
3
  size 5240