End of training

Browse files

Files changed (5) hide show

README.md +84 -0
generation_config.json +12 -0
model.safetensors +1 -1
runs/Jun18_12-16-47_iit-p/events.out.tfevents.1718693214.iit-p.41862.0 +2 -2
runs/Jun18_12-16-47_iit-p/events.out.tfevents.1718693926.iit-p.41862.1 +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,84 @@

+---
+license: apache-2.0
+base_model: facebook/bart-base
+tags:
+- generated_from_trainer
+metrics:
+- rouge
+model-index:
+- name: bart-base-summarize-finetuned
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# bart-base-summarize-finetuned
+This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.3408
+- Rouge1: 79.6622
+- Rouge2: 77.9282
+- Rougel: 79.6654
+- Rougelsum: 79.6384
+- Gen Len: 7.8821
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 64
+- eval_batch_size: 64
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 20
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| No log        | 1.0   | 62   | 0.3856          | 67.6564 | 65.4045 | 67.6202 | 67.6206   | 6.6825  |
+| No log        | 2.0   | 124  | 0.3529          | 70.23   | 68.4349 | 70.2289 | 70.1265   | 6.5756  |
+| No log        | 3.0   | 186  | 0.3303          | 75.4875 | 73.3149 | 75.3918 | 75.3835   | 7.9808  |
+| No log        | 4.0   | 248  | 0.3165          | 76.17   | 74.0354 | 76.2341 | 76.1363   | 7.4435  |
+| No log        | 5.0   | 310  | 0.3094          | 76.9425 | 75.0561 | 76.9582 | 76.8794   | 7.9567  |
+| No log        | 6.0   | 372  | 0.3130          | 78.1808 | 76.2533 | 78.1846 | 78.1377   | 7.9062  |
+| No log        | 7.0   | 434  | 0.3081          | 78.5859 | 76.7258 | 78.6782 | 78.5825   | 7.6946  |
+| No log        | 8.0   | 496  | 0.3195          | 78.8452 | 76.85   | 78.8076 | 78.7562   | 8.1663  |
+| 0.3758        | 9.0   | 558  | 0.3103          | 78.9204 | 77.2131 | 78.9671 | 78.9562   | 8.1341  |
+| 0.3758        | 10.0  | 620  | 0.3091          | 78.7793 | 76.8877 | 78.7503 | 78.7031   | 7.7319  |
+| 0.3758        | 11.0  | 682  | 0.3173          | 79.1693 | 77.4324 | 79.2141 | 79.1671   | 7.8881  |
+| 0.3758        | 12.0  | 744  | 0.3192          | 79.3653 | 77.6962 | 79.4379 | 79.3547   | 7.7339  |
+| 0.3758        | 13.0  | 806  | 0.3246          | 79.041  | 77.1587 | 79.1201 | 79.0828   | 7.8438  |
+| 0.3758        | 14.0  | 868  | 0.3312          | 79.4605 | 77.7629 | 79.5227 | 79.4425   | 7.8014  |
+| 0.3758        | 15.0  | 930  | 0.3300          | 79.7724 | 78.167  | 79.8187 | 79.799    | 7.8609  |
+| 0.3758        | 16.0  | 992  | 0.3409          | 79.4618 | 77.694  | 79.4758 | 79.4325   | 7.8296  |
+| 0.14          | 17.0  | 1054 | 0.3436          | 79.1169 | 77.3095 | 79.1082 | 79.092    | 8.0302  |
+| 0.14          | 18.0  | 1116 | 0.3440          | 78.9896 | 77.2319 | 78.984  | 78.9472   | 7.9325  |
+| 0.14          | 19.0  | 1178 | 0.3399          | 79.531  | 77.8083 | 79.5489 | 79.5005   | 7.871   |
+| 0.14          | 20.0  | 1240 | 0.3408          | 79.6622 | 77.9282 | 79.6654 | 79.6384   | 7.8821  |
+### Framework versions
+- Transformers 4.41.1
+- Pytorch 1.13.1+cu117
+- Datasets 2.19.1
+- Tokenizers 0.19.1

generation_config.json ADDED Viewed

	@@ -0,0 +1,12 @@

+{
+  "bos_token_id": 0,
+  "decoder_start_token_id": 2,
+  "early_stopping": true,
+  "eos_token_id": 2,
+  "forced_bos_token_id": 0,
+  "forced_eos_token_id": 2,
+  "no_repeat_ngram_size": 3,
+  "num_beams": 4,
+  "pad_token_id": 1,
+  "transformers_version": "4.41.1"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2de0fdadf6eab3b01ef7fddfd0718b420acfe19a9f1485519fd080dc9d91d23c
 size 557912620

 version https://git-lfs.github.com/spec/v1
+oid sha256:15c25c37c9805bb429a9763c32c5f9b502e119a20d5f497b1334f8e17c344458
 size 557912620

runs/Jun18_12-16-47_iit-p/events.out.tfevents.1718693214.iit-p.41862.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:76779589fc46720b9bdde5cc50f13fa764ce67fb3b5553d17955485079cfbf60
-size 14820

 version https://git-lfs.github.com/spec/v1
+oid sha256:3e26f0ffae27045532a8bf9499a2ab97d0a7886a71b725972079ae9ca2424e73
+size 17274

runs/Jun18_12-16-47_iit-p/events.out.tfevents.1718693926.iit-p.41862.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:35cf835d423a4befb731424b36ab724749d997a127e48b3fad1493d43b51ed2d
+size 565