apwic
/

summarization-base-0

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

apwic commited on Jul 3, 2024

Commit

e7ee6e3

·

verified ·

1 Parent(s): c3f1303

Model save

Files changed (1) hide show

README.md +13 -13

README.md CHANGED Viewed

@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [LazarusNLP/IndoNanoT5-base](https://huggingface.co/LazarusNLP/IndoNanoT5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7208
-- Rouge1: 0.6602
 - Rouge2: 0.0
-- Rougel: 0.6558
-- Rougelsum: 0.6584
 - Gen Len: 1.0
 ## Model description
@@ -42,8 +42,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.001
-- train_batch_size: 8
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -53,16 +53,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| 1.3499        | 1.0   | 1783 | 0.8425          | 0.6751 | 0.0    | 0.6714 | 0.6724    | 1.0     |
-| 0.7508        | 2.0   | 3566 | 0.7148          | 0.7129 | 0.0    | 0.7106 | 0.7091    | 1.0     |
-| 0.5557        | 3.0   | 5349 | 0.6591          | 0.6716 | 0.0    | 0.6666 | 0.6669    | 1.0     |
-| 0.4087        | 4.0   | 7132 | 0.6609          | 0.7079 | 0.0    | 0.7053 | 0.7064    | 1.0     |
-| 0.2641        | 5.0   | 8915 | 0.7208          | 0.6602 | 0.0    | 0.6558 | 0.6584    | 1.0     |
 ### Framework versions
 - Transformers 4.40.2
-- Pytorch 2.3.0+cu121
-- Datasets 2.19.1
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [LazarusNLP/IndoNanoT5-base](https://huggingface.co/LazarusNLP/IndoNanoT5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7147
+- Rouge1: 0.677
 - Rouge2: 0.0
+- Rougel: 0.6766
+- Rougelsum: 0.6756
 - Gen Len: 1.0
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.001
+- train_batch_size: 16
+- eval_batch_size: 64
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| 1.2189        | 1.0   | 892  | 0.7782          | 0.6456 | 0.0    | 0.6409 | 0.6449    | 1.0     |
+| 0.6795        | 2.0   | 1784 | 0.6560          | 0.6574 | 0.0    | 0.6553 | 0.6569    | 1.0     |
+| 0.4861        | 3.0   | 2676 | 0.6245          | 0.6717 | 0.0    | 0.6667 | 0.6691    | 1.0     |
+| 0.3405        | 4.0   | 3568 | 0.6443          | 0.6974 | 0.0    | 0.6969 | 0.6948    | 1.0     |
+| 0.2041        | 5.0   | 4460 | 0.7147          | 0.677  | 0.0    | 0.6766 | 0.6756    | 1.0     |
 ### Framework versions
 - Transformers 4.40.2
+- Pytorch 2.3.1+cu121
+- Datasets 2.20.0
 - Tokenizers 0.19.1