End of training

Browse files

Files changed (3) hide show

README.md +83 -0
generation_config.json +6 -0
pytorch_model.bin +1 -1

README.md ADDED Viewed

	@@ -0,0 +1,83 @@

+---
+license: apache-2.0
+base_model: t5-small
+tags:
+- generated_from_trainer
+metrics:
+- rouge
+model-index:
+- name: philosophy_model
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# philosophy_model
+This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0004
+- Rouge1: 0.8007
+- Rouge2: 0.7918
+- Rougel: 0.8011
+- Rougelsum: 0.8009
+- Gen Len: 18.4688
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0056
+- train_batch_size: 4
+- eval_batch_size: 4
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.99) and epsilon=1e-06
+- lr_scheduler_type: linear
+- num_epochs: 20
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| No log        | 1.0   | 31   | 2.0791          | 0.363  | 0.1735 | 0.3218 | 0.3229    | 17.4688 |
+| No log        | 2.0   | 62   | 1.3291          | 0.4134 | 0.2484 | 0.3829 | 0.3833    | 18.625  |
+| No log        | 3.0   | 93   | 0.9179          | 0.4946 | 0.3788 | 0.4721 | 0.4708    | 18.625  |
+| No log        | 4.0   | 124  | 0.7255          | 0.5559 | 0.4178 | 0.5355 | 0.5351    | 18.375  |
+| No log        | 5.0   | 155  | 0.4343          | 0.7166 | 0.6528 | 0.7012 | 0.7008    | 18.5938 |
+| No log        | 6.0   | 186  | 0.2940          | 0.7177 | 0.6657 | 0.7018 | 0.7038    | 18.4062 |
+| No log        | 7.0   | 217  | 0.1758          | 0.7609 | 0.7276 | 0.7547 | 0.7541    | 18.125  |
+| No log        | 8.0   | 248  | 0.1660          | 0.737  | 0.702  | 0.7307 | 0.7306    | 18.25   |
+| No log        | 9.0   | 279  | 0.0964          | 0.777  | 0.7549 | 0.7741 | 0.775     | 18.3125 |
+| No log        | 10.0  | 310  | 0.1104          | 0.7848 | 0.7677 | 0.7827 | 0.7821    | 18.4688 |
+| No log        | 11.0  | 341  | 0.0553          | 0.7912 | 0.7801 | 0.7926 | 0.7916    | 18.375  |
+| No log        | 12.0  | 372  | 0.0408          | 0.795  | 0.782  | 0.7952 | 0.7942    | 18.4688 |
+| No log        | 13.0  | 403  | 0.0537          | 0.794  | 0.7822 | 0.7921 | 0.7911    | 18.4688 |
+| No log        | 14.0  | 434  | 0.0313          | 0.8054 | 0.7909 | 0.8047 | 0.8041    | 18.4688 |
+| No log        | 15.0  | 465  | 0.0302          | 0.8023 | 0.7867 | 0.8031 | 0.803     | 18.4688 |
+| No log        | 16.0  | 496  | 0.0167          | 0.8017 | 0.7918 | 0.8026 | 0.802     | 18.4688 |
+| 0.8485        | 17.0  | 527  | 0.0186          | 0.7979 | 0.7895 | 0.7989 | 0.7982    | 18.375  |
+| 0.8485        | 18.0  | 558  | 0.0005          | 0.8007 | 0.7918 | 0.8011 | 0.8009    | 18.4688 |
+| 0.8485        | 19.0  | 589  | 0.0006          | 0.8007 | 0.7918 | 0.8011 | 0.8009    | 18.4688 |
+| 0.8485        | 20.0  | 620  | 0.0004          | 0.8007 | 0.7918 | 0.8011 | 0.8009    | 18.4688 |
+### Framework versions
+- Transformers 4.33.2
+- Pytorch 2.0.1+cu118
+- Datasets 2.14.5
+- Tokenizers 0.13.3

generation_config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "decoder_start_token_id": 0,
+  "eos_token_id": 1,
+  "pad_token_id": 0,
+  "transformers_version": "4.33.2"
+}

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6e86bbaadddd62327137648c8b2940140a1e65a497546f0eb732670398ceb0e4
 size 242071641

 version https://git-lfs.github.com/spec/v1
+oid sha256:b64b361c8b6b97609918565efb0741a6f30e9702a6940c6822d8c0abfeb812dd
 size 242071641