Fine-tuned Operations Receipt Model
Browse files- README.md +22 -61
- model.safetensors +1 -1
- training_args.bin +2 -2
README.md
CHANGED
|
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [DanSarm/receipt-core-model](https://huggingface.co/DanSarm/receipt-core-model) on an unknown dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
-
- Loss: 0.
|
| 20 |
|
| 21 |
## Model description
|
| 22 |
|
|
@@ -41,71 +41,32 @@ The following hyperparameters were used during training:
|
|
| 41 |
- seed: 42
|
| 42 |
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 43 |
- lr_scheduler_type: linear
|
| 44 |
-
- num_epochs:
|
| 45 |
-
- mixed_precision_training: Native AMP
|
| 46 |
|
| 47 |
### Training results
|
| 48 |
|
| 49 |
| Training Loss | Epoch | Step | Validation Loss |
|
| 50 |
|:-------------:|:-----:|:----:|:---------------:|
|
| 51 |
-
| 0.
|
| 52 |
-
| 0.
|
| 53 |
-
| 0.
|
| 54 |
-
| 0.
|
| 55 |
-
| 0.
|
| 56 |
-
| 0.
|
| 57 |
-
| 0.
|
| 58 |
-
| 0.
|
| 59 |
-
| 0.
|
| 60 |
-
| 0.
|
| 61 |
-
| 0.
|
| 62 |
-
| 0.
|
| 63 |
-
| 0.
|
| 64 |
-
| 0.
|
| 65 |
-
| 0.
|
| 66 |
-
| 0.
|
| 67 |
-
| 0.
|
| 68 |
-
| 0.
|
| 69 |
-
| 0.
|
| 70 |
-
| 0.
|
| 71 |
-
| 0.0082 | 21.0 | 399 | 0.1225 |
|
| 72 |
-
| 0.0073 | 22.0 | 418 | 0.1210 |
|
| 73 |
-
| 0.0066 | 23.0 | 437 | 0.1199 |
|
| 74 |
-
| 0.0083 | 24.0 | 456 | 0.1170 |
|
| 75 |
-
| 0.0087 | 25.0 | 475 | 0.1172 |
|
| 76 |
-
| 0.0053 | 26.0 | 494 | 0.1160 |
|
| 77 |
-
| 0.0061 | 27.0 | 513 | 0.1178 |
|
| 78 |
-
| 0.0045 | 28.0 | 532 | 0.1169 |
|
| 79 |
-
| 0.0048 | 29.0 | 551 | 0.1192 |
|
| 80 |
-
| 0.0034 | 30.0 | 570 | 0.1219 |
|
| 81 |
-
| 0.0032 | 31.0 | 589 | 0.1194 |
|
| 82 |
-
| 0.0038 | 32.0 | 608 | 0.1230 |
|
| 83 |
-
| 0.0036 | 33.0 | 627 | 0.1241 |
|
| 84 |
-
| 0.0036 | 34.0 | 646 | 0.1235 |
|
| 85 |
-
| 0.0039 | 35.0 | 665 | 0.1178 |
|
| 86 |
-
| 0.0025 | 36.0 | 684 | 0.1174 |
|
| 87 |
-
| 0.004 | 37.0 | 703 | 0.1146 |
|
| 88 |
-
| 0.003 | 38.0 | 722 | 0.1148 |
|
| 89 |
-
| 0.002 | 39.0 | 741 | 0.1186 |
|
| 90 |
-
| 0.0026 | 40.0 | 760 | 0.1137 |
|
| 91 |
-
| 0.0019 | 41.0 | 779 | 0.1134 |
|
| 92 |
-
| 0.0018 | 42.0 | 798 | 0.1135 |
|
| 93 |
-
| 0.0014 | 43.0 | 817 | 0.1139 |
|
| 94 |
-
| 0.0019 | 44.0 | 836 | 0.1189 |
|
| 95 |
-
| 0.0012 | 45.0 | 855 | 0.1153 |
|
| 96 |
-
| 0.0017 | 46.0 | 874 | 0.1155 |
|
| 97 |
-
| 0.0019 | 47.0 | 893 | 0.1181 |
|
| 98 |
-
| 0.0013 | 48.0 | 912 | 0.1189 |
|
| 99 |
-
| 0.0012 | 49.0 | 931 | 0.1231 |
|
| 100 |
-
| 0.0011 | 50.0 | 950 | 0.1211 |
|
| 101 |
-
| 0.0021 | 51.0 | 969 | 0.1217 |
|
| 102 |
-
| 0.002 | 52.0 | 988 | 0.1235 |
|
| 103 |
-
| 0.0022 | 53.0 | 1007 | 0.1193 |
|
| 104 |
-
| 0.0022 | 54.0 | 1026 | 0.1185 |
|
| 105 |
-
| 0.002 | 55.0 | 1045 | 0.1230 |
|
| 106 |
-
| 0.0014 | 56.0 | 1064 | 0.1246 |
|
| 107 |
-
| 0.0012 | 57.0 | 1083 | 0.1249 |
|
| 108 |
-
| 0.0014 | 58.0 | 1102 | 0.1278 |
|
| 109 |
|
| 110 |
|
| 111 |
### Framework versions
|
|
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [DanSarm/receipt-core-model](https://huggingface.co/DanSarm/receipt-core-model) on an unknown dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
+
- Loss: 0.0950
|
| 20 |
|
| 21 |
## Model description
|
| 22 |
|
|
|
|
| 41 |
- seed: 42
|
| 42 |
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 43 |
- lr_scheduler_type: linear
|
| 44 |
+
- num_epochs: 20
|
|
|
|
| 45 |
|
| 46 |
### Training results
|
| 47 |
|
| 48 |
| Training Loss | Epoch | Step | Validation Loss |
|
| 49 |
|:-------------:|:-----:|:----:|:---------------:|
|
| 50 |
+
| 0.3537 | 1.0 | 29 | 0.2147 |
|
| 51 |
+
| 0.1355 | 2.0 | 58 | 0.1529 |
|
| 52 |
+
| 0.0971 | 3.0 | 87 | 0.1183 |
|
| 53 |
+
| 0.0765 | 4.0 | 116 | 0.1090 |
|
| 54 |
+
| 0.0589 | 5.0 | 145 | 0.1075 |
|
| 55 |
+
| 0.0538 | 6.0 | 174 | 0.1000 |
|
| 56 |
+
| 0.0424 | 7.0 | 203 | 0.1012 |
|
| 57 |
+
| 0.0363 | 8.0 | 232 | 0.0978 |
|
| 58 |
+
| 0.0329 | 9.0 | 261 | 0.0995 |
|
| 59 |
+
| 0.0289 | 10.0 | 290 | 0.0950 |
|
| 60 |
+
| 0.0259 | 11.0 | 319 | 0.0972 |
|
| 61 |
+
| 0.0246 | 12.0 | 348 | 0.0980 |
|
| 62 |
+
| 0.0204 | 13.0 | 377 | 0.0960 |
|
| 63 |
+
| 0.0195 | 14.0 | 406 | 0.0957 |
|
| 64 |
+
| 0.0185 | 15.0 | 435 | 0.0955 |
|
| 65 |
+
| 0.0193 | 16.0 | 464 | 0.0963 |
|
| 66 |
+
| 0.0157 | 17.0 | 493 | 0.0959 |
|
| 67 |
+
| 0.0149 | 18.0 | 522 | 0.0967 |
|
| 68 |
+
| 0.0145 | 19.0 | 551 | 0.0973 |
|
| 69 |
+
| 0.0138 | 20.0 | 580 | 0.0967 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 70 |
|
| 71 |
|
| 72 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 891644712
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f29f5e53eb22d84a051ab40726976e6d596a5ff5f1bb9249024a33ca7e088f12
|
| 3 |
size 891644712
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:361394d111e45d4e9e6261298d94d4ff388236b2471bec8a993412e9c7354a85
|
| 3 |
+
size 5496
|