Maziger1 commited on
Commit
cf17b51
·
verified ·
1 Parent(s): a7de90c

Completed training

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -8,20 +8,20 @@ metrics:
8
  - accuracy
9
  - f1
10
  model-index:
11
- - name: classifier-chapter4
12
  results: []
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
  should probably proofread and complete it, then remove this comment. -->
17
 
18
- # classifier-chapter4
19
 
20
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.2475
23
- - Accuracy: 0.9166
24
- - F1: 0.9165
25
 
26
  ## Model description
27
 
@@ -41,8 +41,8 @@ More information needed
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 5e-05
44
- - train_batch_size: 64
45
- - eval_batch_size: 64
46
  - seed: 42
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
@@ -52,8 +52,8 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
54
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
55
- | No log | 1.0 | 157 | 0.2769 | 0.9087 | 0.9083 |
56
- | No log | 2.0 | 314 | 0.2475 | 0.9166 | 0.9165 |
57
 
58
 
59
  ### Framework versions
 
8
  - accuracy
9
  - f1
10
  model-index:
11
+ - name: assignment1_DestilBert
12
  results: []
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
  should probably proofread and complete it, then remove this comment. -->
17
 
18
+ # assignment1_DestilBert
19
 
20
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.2442
23
+ - Accuracy: 0.9191
24
+ - F1: 0.9191
25
 
26
  ## Model description
27
 
 
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 5e-05
44
+ - train_batch_size: 32
45
+ - eval_batch_size: 32
46
  - seed: 42
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
54
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
55
+ | No log | 1.0 | 313 | 0.2607 | 0.9118 | 0.9116 |
56
+ | 0.3049 | 2.0 | 626 | 0.2442 | 0.9191 | 0.9191 |
57
 
58
 
59
  ### Framework versions