End of training
Browse files- README.md +106 -2
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -1,7 +1,7 @@
|
|
| 1 |
---
|
| 2 |
library_name: transformers
|
| 3 |
license: apache-2.0
|
| 4 |
-
base_model:
|
| 5 |
tags:
|
| 6 |
- generated_from_trainer
|
| 7 |
model-index:
|
|
@@ -14,7 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 14 |
|
| 15 |
# practica_2
|
| 16 |
|
| 17 |
-
This model is a fine-tuned version of [
|
|
|
|
|
|
|
| 18 |
|
| 19 |
## Model description
|
| 20 |
|
|
@@ -44,6 +46,108 @@ The following hyperparameters were used during training:
|
|
| 44 |
|
| 45 |
### Training results
|
| 46 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 47 |
|
| 48 |
|
| 49 |
### Framework versions
|
|
|
|
| 1 |
---
|
| 2 |
library_name: transformers
|
| 3 |
license: apache-2.0
|
| 4 |
+
base_model: seayala/practica_2
|
| 5 |
tags:
|
| 6 |
- generated_from_trainer
|
| 7 |
model-index:
|
|
|
|
| 14 |
|
| 15 |
# practica_2
|
| 16 |
|
| 17 |
+
This model is a fine-tuned version of [seayala/practica_2](https://huggingface.co/seayala/practica_2) on the None dataset.
|
| 18 |
+
It achieves the following results on the evaluation set:
|
| 19 |
+
- Loss: 0.5834
|
| 20 |
|
| 21 |
## Model description
|
| 22 |
|
|
|
|
| 46 |
|
| 47 |
### Training results
|
| 48 |
|
| 49 |
+
| Training Loss | Epoch | Step | Validation Loss |
|
| 50 |
+
|:-------------:|:-----:|:----:|:---------------:|
|
| 51 |
+
| No log | 1.0 | 17 | 0.5917 |
|
| 52 |
+
| No log | 2.0 | 34 | 0.5866 |
|
| 53 |
+
| 0.1407 | 3.0 | 51 | 0.6037 |
|
| 54 |
+
| 0.1407 | 4.0 | 68 | 0.6350 |
|
| 55 |
+
| 0.1407 | 5.0 | 85 | 0.5705 |
|
| 56 |
+
| 0.1413 | 6.0 | 102 | 0.5830 |
|
| 57 |
+
| 0.1413 | 7.0 | 119 | 0.6061 |
|
| 58 |
+
| 0.1413 | 8.0 | 136 | 0.5917 |
|
| 59 |
+
| 0.1359 | 9.0 | 153 | 0.5670 |
|
| 60 |
+
| 0.1359 | 10.0 | 170 | 0.6074 |
|
| 61 |
+
| 0.1359 | 11.0 | 187 | 0.6098 |
|
| 62 |
+
| 0.1317 | 12.0 | 204 | 0.5876 |
|
| 63 |
+
| 0.1317 | 13.0 | 221 | 0.5978 |
|
| 64 |
+
| 0.1317 | 14.0 | 238 | 0.5644 |
|
| 65 |
+
| 0.1254 | 15.0 | 255 | 0.5869 |
|
| 66 |
+
| 0.1254 | 16.0 | 272 | 0.6416 |
|
| 67 |
+
| 0.1254 | 17.0 | 289 | 0.6164 |
|
| 68 |
+
| 0.1218 | 18.0 | 306 | 0.6172 |
|
| 69 |
+
| 0.1218 | 19.0 | 323 | 0.5974 |
|
| 70 |
+
| 0.1218 | 20.0 | 340 | 0.6000 |
|
| 71 |
+
| 0.1264 | 21.0 | 357 | 0.6287 |
|
| 72 |
+
| 0.1264 | 22.0 | 374 | 0.5776 |
|
| 73 |
+
| 0.1264 | 23.0 | 391 | 0.5970 |
|
| 74 |
+
| 0.115 | 24.0 | 408 | 0.5996 |
|
| 75 |
+
| 0.115 | 25.0 | 425 | 0.5608 |
|
| 76 |
+
| 0.115 | 26.0 | 442 | 0.5934 |
|
| 77 |
+
| 0.1169 | 27.0 | 459 | 0.5736 |
|
| 78 |
+
| 0.1169 | 28.0 | 476 | 0.5878 |
|
| 79 |
+
| 0.1169 | 29.0 | 493 | 0.5788 |
|
| 80 |
+
| 0.1044 | 30.0 | 510 | 0.5686 |
|
| 81 |
+
| 0.1044 | 31.0 | 527 | 0.5921 |
|
| 82 |
+
| 0.1044 | 32.0 | 544 | 0.5751 |
|
| 83 |
+
| 0.1017 | 33.0 | 561 | 0.5725 |
|
| 84 |
+
| 0.1017 | 34.0 | 578 | 0.5998 |
|
| 85 |
+
| 0.1017 | 35.0 | 595 | 0.5751 |
|
| 86 |
+
| 0.1012 | 36.0 | 612 | 0.5542 |
|
| 87 |
+
| 0.1012 | 37.0 | 629 | 0.5582 |
|
| 88 |
+
| 0.1012 | 38.0 | 646 | 0.5781 |
|
| 89 |
+
| 0.0985 | 39.0 | 663 | 0.5891 |
|
| 90 |
+
| 0.0985 | 40.0 | 680 | 0.6118 |
|
| 91 |
+
| 0.0985 | 41.0 | 697 | 0.5955 |
|
| 92 |
+
| 0.0889 | 42.0 | 714 | 0.5640 |
|
| 93 |
+
| 0.0889 | 43.0 | 731 | 0.5858 |
|
| 94 |
+
| 0.0889 | 44.0 | 748 | 0.5824 |
|
| 95 |
+
| 0.0829 | 45.0 | 765 | 0.5600 |
|
| 96 |
+
| 0.0829 | 46.0 | 782 | 0.5715 |
|
| 97 |
+
| 0.0829 | 47.0 | 799 | 0.5973 |
|
| 98 |
+
| 0.0836 | 48.0 | 816 | 0.5946 |
|
| 99 |
+
| 0.0836 | 49.0 | 833 | 0.5662 |
|
| 100 |
+
| 0.0808 | 50.0 | 850 | 0.5607 |
|
| 101 |
+
| 0.0808 | 51.0 | 867 | 0.5862 |
|
| 102 |
+
| 0.0808 | 52.0 | 884 | 0.5909 |
|
| 103 |
+
| 0.0804 | 53.0 | 901 | 0.6115 |
|
| 104 |
+
| 0.0804 | 54.0 | 918 | 0.5687 |
|
| 105 |
+
| 0.0804 | 55.0 | 935 | 0.5498 |
|
| 106 |
+
| 0.0716 | 56.0 | 952 | 0.5731 |
|
| 107 |
+
| 0.0716 | 57.0 | 969 | 0.5773 |
|
| 108 |
+
| 0.0716 | 58.0 | 986 | 0.6124 |
|
| 109 |
+
| 0.0708 | 59.0 | 1003 | 0.5770 |
|
| 110 |
+
| 0.0708 | 60.0 | 1020 | 0.5796 |
|
| 111 |
+
| 0.0708 | 61.0 | 1037 | 0.6078 |
|
| 112 |
+
| 0.0691 | 62.0 | 1054 | 0.5943 |
|
| 113 |
+
| 0.0691 | 63.0 | 1071 | 0.5815 |
|
| 114 |
+
| 0.0691 | 64.0 | 1088 | 0.5821 |
|
| 115 |
+
| 0.0632 | 65.0 | 1105 | 0.5937 |
|
| 116 |
+
| 0.0632 | 66.0 | 1122 | 0.5952 |
|
| 117 |
+
| 0.0632 | 67.0 | 1139 | 0.5750 |
|
| 118 |
+
| 0.0651 | 68.0 | 1156 | 0.5858 |
|
| 119 |
+
| 0.0651 | 69.0 | 1173 | 0.5628 |
|
| 120 |
+
| 0.0651 | 70.0 | 1190 | 0.6015 |
|
| 121 |
+
| 0.061 | 71.0 | 1207 | 0.5723 |
|
| 122 |
+
| 0.061 | 72.0 | 1224 | 0.6371 |
|
| 123 |
+
| 0.061 | 73.0 | 1241 | 0.5871 |
|
| 124 |
+
| 0.061 | 74.0 | 1258 | 0.5790 |
|
| 125 |
+
| 0.061 | 75.0 | 1275 | 0.5468 |
|
| 126 |
+
| 0.061 | 76.0 | 1292 | 0.5888 |
|
| 127 |
+
| 0.0587 | 77.0 | 1309 | 0.5894 |
|
| 128 |
+
| 0.0587 | 78.0 | 1326 | 0.5648 |
|
| 129 |
+
| 0.0587 | 79.0 | 1343 | 0.5584 |
|
| 130 |
+
| 0.0508 | 80.0 | 1360 | 0.5719 |
|
| 131 |
+
| 0.0508 | 81.0 | 1377 | 0.5647 |
|
| 132 |
+
| 0.0508 | 82.0 | 1394 | 0.5777 |
|
| 133 |
+
| 0.0564 | 83.0 | 1411 | 0.5679 |
|
| 134 |
+
| 0.0564 | 84.0 | 1428 | 0.5825 |
|
| 135 |
+
| 0.0564 | 85.0 | 1445 | 0.5910 |
|
| 136 |
+
| 0.0515 | 86.0 | 1462 | 0.5750 |
|
| 137 |
+
| 0.0515 | 87.0 | 1479 | 0.5678 |
|
| 138 |
+
| 0.0515 | 88.0 | 1496 | 0.5874 |
|
| 139 |
+
| 0.0506 | 89.0 | 1513 | 0.5406 |
|
| 140 |
+
| 0.0506 | 90.0 | 1530 | 0.5778 |
|
| 141 |
+
| 0.0506 | 91.0 | 1547 | 0.5842 |
|
| 142 |
+
| 0.0458 | 92.0 | 1564 | 0.6066 |
|
| 143 |
+
| 0.0458 | 93.0 | 1581 | 0.5856 |
|
| 144 |
+
| 0.0458 | 94.0 | 1598 | 0.5821 |
|
| 145 |
+
| 0.0488 | 95.0 | 1615 | 0.5833 |
|
| 146 |
+
| 0.0488 | 96.0 | 1632 | 0.5656 |
|
| 147 |
+
| 0.0488 | 97.0 | 1649 | 0.5955 |
|
| 148 |
+
| 0.0466 | 98.0 | 1666 | 0.5563 |
|
| 149 |
+
| 0.0466 | 99.0 | 1683 | 0.5520 |
|
| 150 |
+
| 0.0453 | 100.0 | 1700 | 0.5834 |
|
| 151 |
|
| 152 |
|
| 153 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 25909400
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a6c584160aca55811f55dffc33bedfe58e2f1f591faa32aa22b112c61f1f881c
|
| 3 |
size 25909400
|