David-Xu
/

t5-small_arxiv_model

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Rouge1
       type: rouge
-      value: 0.1788
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,11 +32,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on the scientific_papers dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.5959
-- Rouge1: 0.1788
-- Rouge2: 0.0689
-- Rougel: 0.1435
-- Rougelsum: 0.1434
 - Gen Len: 19.0
 ## Model description
@@ -62,14 +62,16 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| 2.8131        | 1.0   | 20303 | 2.5959          | 0.1788 | 0.0689 | 0.1435 | 0.1434    | 19.0    |
 ### Framework versions

     metrics:
     - name: Rouge1
       type: rouge
+      value: 0.1782
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on the scientific_papers dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.5070
+- Rouge1: 0.1782
+- Rouge2: 0.0681
+- Rougel: 0.1422
+- Rougelsum: 0.1423
 - Gen Len: 19.0
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| 2.7744        | 1.0   | 20303 | 2.5639          | 0.1793 | 0.0691 | 0.1438 | 0.1439    | 19.0    |
+| 2.6041        | 2.0   | 40606 | 2.5171          | 0.1778 | 0.0677 | 0.142  | 0.142     | 19.0    |
+| 2.5843        | 3.0   | 60909 | 2.5070          | 0.1782 | 0.0681 | 0.1422 | 0.1423    | 19.0    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ea8cd49718341317641f5712ae43fe4f509384b25d28666bf161881fcd934dfb
 size 242041896

 version https://git-lfs.github.com/spec/v1
+oid sha256:0561177eb7a2e91fd7d4cf12b1d0c69f1b3dd9da93438f5f7992d1967095f546
 size 242041896

runs/Feb28_03-00-41_891014fe4ff4/events.out.tfevents.1709089242.891014fe4ff4.2709.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:83a2ccb28dca233f305e7c521dfd483cdf563dd1ec994057e53a1c52a2d088d0
-size 25687

 version https://git-lfs.github.com/spec/v1
+oid sha256:8fc5940f7baed35e5ae923cc24a05437053cd22bc773790cda5c9e264fe9af24
+size 26582