vinnyy/codet5-finetuned-42epochs

Browse files

Files changed (4) hide show

README.md +28 -36
config.json +1 -1
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 library_name: transformers
 license: apache-2.0
-base_model: vinzur/results
 tags:
 - generated_from_trainer
 model-index:
@@ -14,9 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
 # results
-This model is a fine-tuned version of [vinzur/results](https://huggingface.co/vinzur/results) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6053
 ## Model description
@@ -35,53 +35,45 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
 - train_batch_size: 2
 - eval_batch_size: 2
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 30
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 62   | 0.9665          |
-| 1.5431        | 2.0   | 124  | 0.8184          |
-| 1.5431        | 3.0   | 186  | 0.7439          |
-| 0.8654        | 4.0   | 248  | 0.7027          |
-| 0.688         | 5.0   | 310  | 0.6795          |
-| 0.688         | 6.0   | 372  | 0.6600          |
-| 0.6288        | 7.0   | 434  | 0.6505          |
-| 0.6288        | 8.0   | 496  | 0.6444          |
-| 0.5713        | 9.0   | 558  | 0.6402          |
-| 0.5478        | 10.0  | 620  | 0.6392          |
-| 0.5478        | 11.0  | 682  | 0.6371          |
-| 0.5366        | 12.0  | 744  | 0.6302          |
-| 0.5056        | 13.0  | 806  | 0.6141          |
-| 0.5056        | 14.0  | 868  | 0.6183          |
-| 0.4948        | 15.0  | 930  | 0.6163          |
-| 0.4948        | 16.0  | 992  | 0.6125          |
-| 0.4468        | 17.0  | 1054 | 0.6136          |
-| 0.4398        | 18.0  | 1116 | 0.6130          |
-| 0.4398        | 19.0  | 1178 | 0.6123          |
-| 0.4284        | 20.0  | 1240 | 0.6128          |
-| 0.4322        | 21.0  | 1302 | 0.6138          |
-| 0.4322        | 22.0  | 1364 | 0.6067          |
-| 0.4203        | 23.0  | 1426 | 0.6108          |
-| 0.4203        | 24.0  | 1488 | 0.6085          |
-| 0.4064        | 25.0  | 1550 | 0.6088          |
-| 0.4085        | 26.0  | 1612 | 0.6108          |
-| 0.4085        | 27.0  | 1674 | 0.6069          |
-| 0.4132        | 28.0  | 1736 | 0.6037          |
-| 0.4132        | 29.0  | 1798 | 0.6067          |
-| 0.3854        | 30.0  | 1860 | 0.6053          |
 ### Framework versions
 - Transformers 4.46.3
 - Pytorch 2.5.1+cu121
-- Datasets 3.1.0
 - Tokenizers 0.20.3

 ---
 library_name: transformers
 license: apache-2.0
+base_model: vinnyy/results
 tags:
 - generated_from_trainer
 model-index:
 # results
+This model is a fine-tuned version of [vinnyy/results](https://huggingface.co/vinnyy/results) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4479
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
 - train_batch_size: 2
 - eval_batch_size: 2
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 25
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 87   | 0.5600          |
+| 0.6043        | 2.0   | 174  | 0.5312          |
+| 0.5608        | 3.0   | 261  | 0.4930          |
+| 0.4626        | 4.0   | 348  | 0.4837          |
+| 0.4311        | 5.0   | 435  | 0.4736          |
+| 0.4456        | 6.0   | 522  | 0.4793          |
+| 0.391         | 7.0   | 609  | 0.4676          |
+| 0.391         | 8.0   | 696  | 0.4674          |
+| 0.383         | 9.0   | 783  | 0.4656          |
+| 0.3735        | 10.0  | 870  | 0.4637          |
+| 0.4062        | 11.0  | 957  | 0.4614          |
+| 0.3528        | 12.0  | 1044 | 0.4588          |
+| 0.3622        | 13.0  | 1131 | 0.4592          |
+| 0.3245        | 14.0  | 1218 | 0.4574          |
+| 0.3267        | 15.0  | 1305 | 0.4564          |
+| 0.3267        | 16.0  | 1392 | 0.4479          |
+| 0.3176        | 17.0  | 1479 | 0.4500          |
+| 0.3127        | 18.0  | 1566 | 0.4499          |
+| 0.3053        | 19.0  | 1653 | 0.4506          |
+| 0.2925        | 20.0  | 1740 | 0.4506          |
+| 0.3064        | 21.0  | 1827 | 0.4498          |
+| 0.2953        | 22.0  | 1914 | 0.4503          |
 ### Framework versions
 - Transformers 4.46.3
 - Pytorch 2.5.1+cu121
+- Datasets 3.2.0
 - Tokenizers 0.20.3

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "vinzur/results",
   "architectures": [
     "T5ForConditionalGeneration"
   ],

 {
+  "_name_or_path": "vinnyy/results",
   "architectures": [
     "T5ForConditionalGeneration"
   ],

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e959db6a7d0a7936f927d2810801d41cdb00b08675d7268e7ae32c377ec48858
 size 891558696

 version https://git-lfs.github.com/spec/v1
+oid sha256:8d4ac4c24da1d5a48e4fbca581dd5f4772ac1ae58f7a3a26c0fd39b30c0e1a34
 size 891558696

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:28a24f4b2c98de876d9ec15ddd7be331cca5a2aee8b5fc1cbf2a4d15545914c2
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:bc5c8409fd8d48c62aa8892279f6ad90a5898c36e3701a6ec2d0c09028d28854
 size 5240