paulh27
/

xsum_unaligned_smallT5

text2text-generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

paulh27 commited on Apr 16, 2024

Commit

c1b140d

·

verified ·

1 Parent(s): 264b1d7

Training complete

Files changed (2) hide show

README.md +2 -17
generation_config.json +1 -0

README.md CHANGED Viewed

@@ -4,8 +4,6 @@ base_model: google-t5/t5-small
 tags:
 - summarization
 - generated_from_trainer
-metrics:
-- rouge
 model-index:
 - name: xsum_unaligned_smallT5
   results: []
@@ -17,12 +15,6 @@ should probably proofread and complete it, then remove this comment. -->
 # xsum_unaligned_smallT5
 This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 2.8386
-- Rouge1: 0.2219
-- Rouge2: 0.0465
-- Rougel: 0.1675
-- Rougelsum: 0.1714
 ## Model description
@@ -49,18 +41,11 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- training_steps: 20
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
-| No log        | 0.86  | 3    | 3.3139          | 0.2048 | 0.0374 | 0.1476 | 0.1663    |
-| 3.6766        | 2.0   | 7    | 3.0257          | 0.2078 | 0.0347 | 0.1515 | 0.1644    |
-| 3.6766        | 2.86  | 10   | 2.9150          | 0.2208 | 0.0427 | 0.1660 | 0.1761    |
-| 3.0126        | 4.0   | 14   | 2.8600          | 0.2229 | 0.0449 | 0.1704 | 0.1782    |
-| 3.0126        | 4.86  | 17   | 2.8455          | 0.2184 | 0.0455 | 0.1649 | 0.1689    |
-| 2.8062        | 5.71  | 20   | 2.8386          | 0.2219 | 0.0465 | 0.1675 | 0.1714    |
 ### Framework versions

 tags:
 - summarization
 - generated_from_trainer
 model-index:
 - name: xsum_unaligned_smallT5
   results: []
 # xsum_unaligned_smallT5
 This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
 ## Model description
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- training_steps: 200000
+- mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -1,4 +1,5 @@
 {
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,

 {
+  "_from_model_config": true,
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,