End of training

Browse files

Files changed (4) hide show

README.md +33 -13
model.safetensors +1 -1
runs/Jun03_12-10-09_c4a222934390/events.out.tfevents.1717416611.c4a222934390.167.4 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
-base_model: EleutherAI/pythia-70m
 model-index:
 - name: polish_wikipedia_model
   results: []
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [EleutherAI/pythia-70m](https://huggingface.co/EleutherAI/pythia-70m) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0404
 ## Model description
@@ -40,22 +40,42 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 133  | 0.4825          |
-| No log        | 2.0   | 266  | 0.4049          |
-| No log        | 3.0   | 399  | 0.4124          |
-| 0.4774        | 4.0   | 532  | 0.2849          |
-| 0.4774        | 5.0   | 665  | 0.2543          |
-| 0.4774        | 6.0   | 798  | 0.1860          |
-| 0.4774        | 7.0   | 931  | 0.1209          |
-| 0.2561        | 8.0   | 1064 | 0.0836          |
-| 0.2561        | 9.0   | 1197 | 0.0530          |
-| 0.2561        | 10.0  | 1330 | 0.0404          |
 ### Framework versions

 ---
 license: apache-2.0
+base_model: EleutherAI/pythia-70m
 tags:
 - generated_from_trainer
 model-index:
 - name: polish_wikipedia_model
   results: []
 This model is a fine-tuned version of [EleutherAI/pythia-70m](https://huggingface.co/EleutherAI/pythia-70m) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0319
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 30
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 133  | 0.4322          |
+| No log        | 2.0   | 266  | 0.3766          |
+| No log        | 3.0   | 399  | 0.3599          |
+| 0.4669        | 4.0   | 532  | 0.3247          |
+| 0.4669        | 5.0   | 665  | 0.2830          |
+| 0.4669        | 6.0   | 798  | 0.2628          |
+| 0.4669        | 7.0   | 931  | 0.2573          |
+| 0.3481        | 8.0   | 1064 | 0.2443          |
+| 0.3481        | 9.0   | 1197 | 0.1904          |
+| 0.3481        | 10.0  | 1330 | 0.1799          |
+| 0.3481        | 11.0  | 1463 | 0.1475          |
+| 0.2502        | 12.0  | 1596 | 0.1292          |
+| 0.2502        | 13.0  | 1729 | 0.1168          |
+| 0.2502        | 14.0  | 1862 | 0.1103          |
+| 0.2502        | 15.0  | 1995 | 0.0989          |
+| 0.1572        | 16.0  | 2128 | 0.0890          |
+| 0.1572        | 17.0  | 2261 | 0.0736          |
+| 0.1572        | 18.0  | 2394 | 0.0672          |
+| 0.1007        | 19.0  | 2527 | 0.0592          |
+| 0.1007        | 20.0  | 2660 | 0.0550          |
+| 0.1007        | 21.0  | 2793 | 0.0517          |
+| 0.1007        | 22.0  | 2926 | 0.0497          |
+| 0.0674        | 23.0  | 3059 | 0.0458          |
+| 0.0674        | 24.0  | 3192 | 0.0421          |
+| 0.0674        | 25.0  | 3325 | 0.0394          |
+| 0.0674        | 26.0  | 3458 | 0.0378          |
+| 0.0491        | 27.0  | 3591 | 0.0357          |
+| 0.0491        | 28.0  | 3724 | 0.0337          |
+| 0.0491        | 29.0  | 3857 | 0.0323          |
+| 0.0491        | 30.0  | 3990 | 0.0319          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:12698b7b59b958f659da901f9e1e4194b27b0643292b80ad3c355f980a049289
 size 281715176

 version https://git-lfs.github.com/spec/v1
+oid sha256:235c4e440f2c71c2e96ff8d219da8cfa791dc929ca1c7df51f76eb7f74a21959
 size 281715176

runs/Jun03_12-10-09_c4a222934390/events.out.tfevents.1717416611.c4a222934390.167.4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4868fad21726fee063717331e91a3d444c19b29bb91bec73ecff2a6c23ca55c0
+size 14885

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:875e2ed0068d7bed07c05f5271ed7f755fe5407a2d7a8921bc6f07bd4488d233
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:9d2be04d922dfbec40dba786b2af32ba5d7698e6fd00eaf6d77250bc1da61514
 size 5112