Kamer commited on
Commit
c3b2980
·
1 Parent(s): 251b94c

End of training

Browse files
Files changed (1) hide show
  1. README.md +81 -0
README.md ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: distilbert-base-uncased
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - accuracy
8
+ model-index:
9
+ - name: DayOne
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # DayOne
17
+
18
+ This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.5286
21
+ - Accuracy: 0.8686
22
+ - F1 Macro: 0.6109
23
+ - F1 Class 0: 0.9182
24
+ - F1 Class 1: 0.0
25
+ - F1 Class 2: 0.8817
26
+ - F1 Class 3: 0.9091
27
+ - F1 Class 4: 0.7556
28
+ - F1 Class 5: 0.6667
29
+ - F1 Class 6: 0.6897
30
+ - F1 Class 7: 0.9701
31
+ - F1 Class 8: 0.8889
32
+ - F1 Class 9: 0.7500
33
+ - F1 Class 10: 0.8926
34
+ - F1 Class 11: 0.0
35
+ - F1 Class 12: 0.7888
36
+ - F1 Class 13: 0.0
37
+ - F1 Class 14: 0.8213
38
+ - F1 Class 15: 0.0
39
+ - F1 Class 16: 0.0
40
+ - F1 Class 17: 0.9772
41
+ - F1 Class 18: 0.8381
42
+ - F1 Class 19: 0.4706
43
+
44
+ ## Model description
45
+
46
+ More information needed
47
+
48
+ ## Intended uses & limitations
49
+
50
+ More information needed
51
+
52
+ ## Training and evaluation data
53
+
54
+ More information needed
55
+
56
+ ## Training procedure
57
+
58
+ ### Training hyperparameters
59
+
60
+ The following hyperparameters were used during training:
61
+ - learning_rate: 2e-05
62
+ - train_batch_size: 16
63
+ - eval_batch_size: 16
64
+ - seed: 42
65
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
66
+ - lr_scheduler_type: linear
67
+ - num_epochs: 2
68
+
69
+ ### Training results
70
+
71
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | F1 Class 0 | F1 Class 1 | F1 Class 2 | F1 Class 3 | F1 Class 4 | F1 Class 5 | F1 Class 6 | F1 Class 7 | F1 Class 8 | F1 Class 9 | F1 Class 10 | F1 Class 11 | F1 Class 12 | F1 Class 13 | F1 Class 14 | F1 Class 15 | F1 Class 16 | F1 Class 17 | F1 Class 18 | F1 Class 19 |
72
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:----------:|:----------:|:----------:|:----------:|:----------:|:----------:|:----------:|:----------:|:----------:|:----------:|:-----------:|:-----------:|:-----------:|:-----------:|:-----------:|:-----------:|:-----------:|:-----------:|:-----------:|:-----------:|
73
+ | 0.9508 | 1.77 | 1000 | 0.5343 | 0.8708 | 0.6138 | 0.9245 | 0.0 | 0.8938 | 0.9091 | 0.7835 | 0.6966 | 0.6947 | 0.9762 | 0.8889 | 0.7723 | 0.8896 | 0.0 | 0.7932 | 0.0 | 0.8194 | 0.0 | 0.0 | 0.9772 | 0.7857 | 0.4706 |
74
+
75
+
76
+ ### Framework versions
77
+
78
+ - Transformers 4.32.0
79
+ - Pytorch 2.0.1+cu117
80
+ - Datasets 2.14.4
81
+ - Tokenizers 0.13.3