avkumararun commited on
Commit
863e33f
·
verified ·
1 Parent(s): a7e9a2a

avkumararun/bert-tiny

Browse files
Files changed (3) hide show
  1. README.md +57 -12
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
21
 
22
  This model is a fine-tuned version of [prajjwal1/bert-tiny](https://huggingface.co/prajjwal1/bert-tiny) on the None dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 1.0985
25
- - Accuracy: 0.91
26
- - F1: 0.9096
27
- - Precision: 0.9139
28
- - Recall: 0.9123
29
 
30
  ## Model description
31
 
@@ -50,22 +50,67 @@ The following hyperparameters were used during training:
50
  - seed: 42
51
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
52
  - lr_scheduler_type: linear
53
- - num_epochs: 5
54
 
55
  ### Training results
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
58
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
59
- | No log | 1.0 | 125 | 1.5109 | 0.59 | 0.6026 | 0.7073 | 0.5895 |
60
- | No log | 2.0 | 250 | 1.3482 | 0.828 | 0.8293 | 0.8313 | 0.8301 |
61
- | No log | 3.0 | 375 | 1.2057 | 0.882 | 0.8806 | 0.8837 | 0.8850 |
62
- | 1.4080 | 4.0 | 500 | 1.1231 | 0.906 | 0.9056 | 0.9099 | 0.9085 |
63
- | 1.4080 | 5.0 | 625 | 1.0985 | 0.91 | 0.9096 | 0.9139 | 0.9123 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
64
 
65
 
66
  ### Framework versions
67
 
68
  - Transformers 5.0.0
69
- - Pytorch 2.9.0+cpu
70
  - Datasets 4.0.0
71
  - Tokenizers 0.22.2
 
21
 
22
  This model is a fine-tuned version of [prajjwal1/bert-tiny](https://huggingface.co/prajjwal1/bert-tiny) on the None dataset.
23
  It achieves the following results on the evaluation set:
24
+ - Loss: 0.0081
25
+ - Accuracy: 1.0
26
+ - F1: 1.0
27
+ - Precision: 1.0
28
+ - Recall: 1.0
29
 
30
  ## Model description
31
 
 
50
  - seed: 42
51
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
52
  - lr_scheduler_type: linear
53
+ - num_epochs: 50
54
 
55
  ### Training results
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
58
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
59
+ | No log | 1.0 | 125 | 1.4595 | 0.666 | 0.5919 | 0.7955 | 0.6412 |
60
+ | No log | 2.0 | 250 | 1.2117 | 0.894 | 0.8814 | 0.8937 | 0.8839 |
61
+ | No log | 3.0 | 375 | 0.9703 | 0.924 | 0.9164 | 0.9352 | 0.9149 |
62
+ | 1.2705 | 4.0 | 500 | 0.7647 | 0.934 | 0.9284 | 0.9428 | 0.9262 |
63
+ | 1.2705 | 5.0 | 625 | 0.5898 | 0.97 | 0.9664 | 0.9722 | 0.9659 |
64
+ | 1.2705 | 6.0 | 750 | 0.4600 | 0.97 | 0.9664 | 0.9722 | 0.9659 |
65
+ | 1.2705 | 7.0 | 875 | 0.3596 | 0.97 | 0.9664 | 0.9722 | 0.9659 |
66
+ | 0.5486 | 8.0 | 1000 | 0.2753 | 0.97 | 0.9664 | 0.9722 | 0.9659 |
67
+ | 0.5486 | 9.0 | 1125 | 0.1988 | 1.0 | 1.0 | 1.0 | 1.0 |
68
+ | 0.5486 | 10.0 | 1250 | 0.1469 | 1.0 | 1.0 | 1.0 | 1.0 |
69
+ | 0.5486 | 11.0 | 1375 | 0.1139 | 1.0 | 1.0 | 1.0 | 1.0 |
70
+ | 0.1935 | 12.0 | 1500 | 0.0904 | 1.0 | 1.0 | 1.0 | 1.0 |
71
+ | 0.1935 | 13.0 | 1625 | 0.0743 | 1.0 | 1.0 | 1.0 | 1.0 |
72
+ | 0.1935 | 14.0 | 1750 | 0.0630 | 1.0 | 1.0 | 1.0 | 1.0 |
73
+ | 0.1935 | 15.0 | 1875 | 0.0542 | 1.0 | 1.0 | 1.0 | 1.0 |
74
+ | 0.0781 | 16.0 | 2000 | 0.0473 | 1.0 | 1.0 | 1.0 | 1.0 |
75
+ | 0.0781 | 17.0 | 2125 | 0.0418 | 1.0 | 1.0 | 1.0 | 1.0 |
76
+ | 0.0781 | 18.0 | 2250 | 0.0374 | 1.0 | 1.0 | 1.0 | 1.0 |
77
+ | 0.0781 | 19.0 | 2375 | 0.0337 | 1.0 | 1.0 | 1.0 | 1.0 |
78
+ | 0.0449 | 20.0 | 2500 | 0.0305 | 1.0 | 1.0 | 1.0 | 1.0 |
79
+ | 0.0449 | 21.0 | 2625 | 0.0279 | 1.0 | 1.0 | 1.0 | 1.0 |
80
+ | 0.0449 | 22.0 | 2750 | 0.0256 | 1.0 | 1.0 | 1.0 | 1.0 |
81
+ | 0.0449 | 23.0 | 2875 | 0.0236 | 1.0 | 1.0 | 1.0 | 1.0 |
82
+ | 0.0305 | 24.0 | 3000 | 0.0219 | 1.0 | 1.0 | 1.0 | 1.0 |
83
+ | 0.0305 | 25.0 | 3125 | 0.0204 | 1.0 | 1.0 | 1.0 | 1.0 |
84
+ | 0.0305 | 26.0 | 3250 | 0.0190 | 1.0 | 1.0 | 1.0 | 1.0 |
85
+ | 0.0305 | 27.0 | 3375 | 0.0178 | 1.0 | 1.0 | 1.0 | 1.0 |
86
+ | 0.0224 | 28.0 | 3500 | 0.0167 | 1.0 | 1.0 | 1.0 | 1.0 |
87
+ | 0.0224 | 29.0 | 3625 | 0.0157 | 1.0 | 1.0 | 1.0 | 1.0 |
88
+ | 0.0224 | 30.0 | 3750 | 0.0149 | 1.0 | 1.0 | 1.0 | 1.0 |
89
+ | 0.0224 | 31.0 | 3875 | 0.0141 | 1.0 | 1.0 | 1.0 | 1.0 |
90
+ | 0.0181 | 32.0 | 4000 | 0.0134 | 1.0 | 1.0 | 1.0 | 1.0 |
91
+ | 0.0181 | 33.0 | 4125 | 0.0127 | 1.0 | 1.0 | 1.0 | 1.0 |
92
+ | 0.0181 | 34.0 | 4250 | 0.0121 | 1.0 | 1.0 | 1.0 | 1.0 |
93
+ | 0.0181 | 35.0 | 4375 | 0.0116 | 1.0 | 1.0 | 1.0 | 1.0 |
94
+ | 0.0141 | 36.0 | 4500 | 0.0111 | 1.0 | 1.0 | 1.0 | 1.0 |
95
+ | 0.0141 | 37.0 | 4625 | 0.0107 | 1.0 | 1.0 | 1.0 | 1.0 |
96
+ | 0.0141 | 38.0 | 4750 | 0.0103 | 1.0 | 1.0 | 1.0 | 1.0 |
97
+ | 0.0141 | 39.0 | 4875 | 0.0099 | 1.0 | 1.0 | 1.0 | 1.0 |
98
+ | 0.0120 | 40.0 | 5000 | 0.0096 | 1.0 | 1.0 | 1.0 | 1.0 |
99
+ | 0.0120 | 41.0 | 5125 | 0.0093 | 1.0 | 1.0 | 1.0 | 1.0 |
100
+ | 0.0120 | 42.0 | 5250 | 0.0091 | 1.0 | 1.0 | 1.0 | 1.0 |
101
+ | 0.0120 | 43.0 | 5375 | 0.0088 | 1.0 | 1.0 | 1.0 | 1.0 |
102
+ | 0.0108 | 44.0 | 5500 | 0.0087 | 1.0 | 1.0 | 1.0 | 1.0 |
103
+ | 0.0108 | 45.0 | 5625 | 0.0085 | 1.0 | 1.0 | 1.0 | 1.0 |
104
+ | 0.0108 | 46.0 | 5750 | 0.0083 | 1.0 | 1.0 | 1.0 | 1.0 |
105
+ | 0.0108 | 47.0 | 5875 | 0.0082 | 1.0 | 1.0 | 1.0 | 1.0 |
106
+ | 0.0096 | 48.0 | 6000 | 0.0082 | 1.0 | 1.0 | 1.0 | 1.0 |
107
+ | 0.0096 | 49.0 | 6125 | 0.0081 | 1.0 | 1.0 | 1.0 | 1.0 |
108
+ | 0.0096 | 50.0 | 6250 | 0.0081 | 1.0 | 1.0 | 1.0 | 1.0 |
109
 
110
 
111
  ### Framework versions
112
 
113
  - Transformers 5.0.0
114
+ - Pytorch 2.9.0+cu126
115
  - Datasets 4.0.0
116
  - Tokenizers 0.22.2
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bfb86fba5d902b5a1747d4ea6191109b70adc575aef2ddbb70b45116886840d8
3
  size 17550852
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f2530860f856d643308a74838d548237ef41ed84208c1b5909154573a466000e
3
  size 17550852
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:29da7030cf5f3ca4643176bcdd20063df80298be4e2d012719bb352af72527d8
3
  size 5201
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:861d4c7a1cc85622dce4475145c485ff7bbed18258de676f2782be919813af24
3
  size 5201