rogerpolo commited on
Commit
bd7c0f5
·
verified ·
1 Parent(s): 5272ece

Training complete

Browse files
Files changed (3) hide show
  1. README.md +27 -30
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -6,8 +6,6 @@ tags:
6
  - generated_from_trainer
7
  metrics:
8
  - accuracy
9
- - precision
10
- - recall
11
  - f1
12
  model-index:
13
  - name: results
@@ -21,11 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
21
 
22
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 0.1795
25
- - Accuracy: 0.9418
26
- - Precision: 0.9421
27
- - Recall: 0.9418
28
- - F1: 0.9419
29
 
30
  ## Model description
31
 
@@ -52,33 +48,34 @@ The following hyperparameters were used during training:
52
  - total_train_batch_size: 16
53
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
54
  - lr_scheduler_type: linear
55
- - num_epochs: 3
56
  - mixed_precision_training: Native AMP
57
 
58
  ### Training results
59
 
60
- | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
61
- |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|
62
- | 0.4176 | 0.0667 | 500 | 0.3007 | 0.9030 | 0.9031 | 0.9030 | 0.9027 |
63
- | 0.265 | 0.1333 | 1000 | 0.2526 | 0.9161 | 0.9161 | 0.9161 | 0.9159 |
64
- | 0.2408 | 0.2 | 1500 | 0.2476 | 0.9226 | 0.9245 | 0.9226 | 0.9224 |
65
- | 0.231 | 0.2667 | 2000 | 0.2341 | 0.9297 | 0.9311 | 0.9297 | 0.9295 |
66
- | 0.2335 | 0.3333 | 2500 | 0.2143 | 0.9297 | 0.9296 | 0.9297 | 0.9297 |
67
- | 0.2417 | 0.4 | 3000 | 0.2000 | 0.9325 | 0.9338 | 0.9325 | 0.9327 |
68
- | 0.208 | 0.4667 | 3500 | 0.2060 | 0.9314 | 0.9323 | 0.9314 | 0.9315 |
69
- | 0.2133 | 0.5333 | 4000 | 0.2011 | 0.9346 | 0.9349 | 0.9346 | 0.9347 |
70
- | 0.2042 | 0.6 | 4500 | 0.2035 | 0.9345 | 0.9366 | 0.9345 | 0.9346 |
71
- | 0.1947 | 0.6667 | 5000 | 0.1945 | 0.9392 | 0.9392 | 0.9392 | 0.9392 |
72
- | 0.1954 | 0.7333 | 5500 | 0.1993 | 0.9367 | 0.9388 | 0.9367 | 0.9368 |
73
- | 0.1958 | 0.8 | 6000 | 0.2001 | 0.9411 | 0.9414 | 0.9411 | 0.9412 |
74
- | 0.2002 | 0.8667 | 6500 | 0.1906 | 0.9397 | 0.9403 | 0.9397 | 0.9399 |
75
- | 0.1885 | 0.9333 | 7000 | 0.1898 | 0.9420 | 0.9422 | 0.9420 | 0.9420 |
76
- | 0.1979 | 1.0 | 7500 | 0.1795 | 0.9418 | 0.9421 | 0.9418 | 0.9419 |
77
- | 0.1492 | 1.0667 | 8000 | 0.1933 | 0.9459 | 0.9460 | 0.9459 | 0.9459 |
78
- | 0.148 | 1.1333 | 8500 | 0.1978 | 0.9426 | 0.9437 | 0.9426 | 0.9427 |
79
- | 0.137 | 1.2 | 9000 | 0.1941 | 0.9466 | 0.9469 | 0.9466 | 0.9467 |
80
- | 0.1433 | 1.2667 | 9500 | 0.1988 | 0.9453 | 0.9457 | 0.9453 | 0.9453 |
81
- | 0.1403 | 1.3333 | 10000 | 0.1926 | 0.9441 | 0.9448 | 0.9441 | 0.9441 |
 
82
 
83
 
84
  ### Framework versions
 
6
  - generated_from_trainer
7
  metrics:
8
  - accuracy
 
 
9
  - f1
10
  model-index:
11
  - name: results
 
19
 
20
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.1979
23
+ - Accuracy: 0.9434
24
+ - F1: 0.9435
 
 
25
 
26
  ## Model description
27
 
 
48
  - total_train_batch_size: 16
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
+ - num_epochs: 2
52
  - mixed_precision_training: Native AMP
53
 
54
  ### Training results
55
 
56
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
57
+ |:-------------:|:------:|:-----:|:---------------:|:--------:|:------:|
58
+ | 0.4206 | 0.0667 | 500 | 0.3065 | 0.9021 | 0.9019 |
59
+ | 0.2721 | 0.1333 | 1000 | 0.2626 | 0.9151 | 0.9149 |
60
+ | 0.2403 | 0.2 | 1500 | 0.2584 | 0.9192 | 0.9190 |
61
+ | 0.2314 | 0.2667 | 2000 | 0.2398 | 0.9267 | 0.9263 |
62
+ | 0.232 | 0.3333 | 2500 | 0.2190 | 0.9318 | 0.9319 |
63
+ | 0.246 | 0.4 | 3000 | 0.1979 | 0.9338 | 0.9340 |
64
+ | 0.2092 | 0.4667 | 3500 | 0.2066 | 0.9309 | 0.9310 |
65
+ | 0.2171 | 0.5333 | 4000 | 0.2058 | 0.9353 | 0.9353 |
66
+ | 0.2102 | 0.6 | 4500 | 0.1999 | 0.9368 | 0.9370 |
67
+ | 0.2 | 0.6667 | 5000 | 0.1967 | 0.9363 | 0.9363 |
68
+ | 0.1952 | 0.7333 | 5500 | 0.2025 | 0.9358 | 0.9359 |
69
+ | 0.1963 | 0.8 | 6000 | 0.2062 | 0.9374 | 0.9375 |
70
+ | 0.2025 | 0.8667 | 6500 | 0.1918 | 0.9386 | 0.9388 |
71
+ | 0.1839 | 0.9333 | 7000 | 0.1943 | 0.9413 | 0.9414 |
72
+ | 0.2008 | 1.0 | 7500 | 0.1766 | 0.9420 | 0.9420 |
73
+ | 0.1467 | 1.0667 | 8000 | 0.1948 | 0.9426 | 0.9426 |
74
+ | 0.1502 | 1.1333 | 8500 | 0.1960 | 0.9413 | 0.9414 |
75
+ | 0.1331 | 1.2 | 9000 | 0.1977 | 0.9443 | 0.9444 |
76
+ | 0.1421 | 1.2667 | 9500 | 0.2006 | 0.9428 | 0.9428 |
77
+ | 0.1375 | 1.3333 | 10000 | 0.1931 | 0.9437 | 0.9437 |
78
+ | 0.1375 | 1.4 | 10500 | 0.1979 | 0.9434 | 0.9435 |
79
 
80
 
81
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ae4472230eac49417440cdc0381066e1936439bbd9337cd5a364423281db3ccd
3
  size 267838720
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:25856dd2586cc7cf418ce69c130597d92468fcda1dbcfbd003cabcb17ba8cff6
3
  size 267838720
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:90ee064dc29029295a42e7a04c7e9f1204438e7419d4d36e1f27faeacbf465a3
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8dd43b467ffeb9bc88d50f221b0315810ec2cf5bc7fa5921eb160320e9b3aeca
3
  size 5240