li-jay-cs
/

gptj-supervised-summarize-checkpoint

@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [li-jay-cs/gptj-supervised-summarize-checkpoint](https://huggingface.co/li-jay-cs/gptj-supervised-summarize-checkpoint) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9180
-- Rouge1: 0.5817
-- Rouge2: 0.1815
-- Rougel: 0.3832
-- Rougelsum: 0.5069
 ## Model description
@@ -48,17 +48,17 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
-- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
-| 2.0512        | 0.21  | 500  | 1.9746          | 0.5794 | 0.1755 | 0.3743 | 0.5028    |
-| 2.0334        | 0.43  | 1000 | 1.9463          | 0.5795 | 0.1779 | 0.3784 | 0.5032    |
-| 1.9925        | 0.64  | 1500 | 1.9287          | 0.5831 | 0.1810 | 0.3812 | 0.5069    |
-| 1.9887        | 0.86  | 2000 | 1.9180          | 0.5817 | 0.1815 | 0.3832 | 0.5069    |
 ### Framework versions

 This model is a fine-tuned version of [li-jay-cs/gptj-supervised-summarize-checkpoint](https://huggingface.co/li-jay-cs/gptj-supervised-summarize-checkpoint) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.8506
+- Rouge1: 0.5938
+- Rouge2: 0.1912
+- Rougel: 0.3937
+- Rougelsum: 0.5184
 ## Model description
 - optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
+- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
+| 1.904         | 0.43  | 1000 | 1.8633          | 0.5912 | 0.1888 | 0.3913 | 0.5149    |
+| 1.8931        | 0.86  | 2000 | 1.8584          | 0.5907 | 0.1890 | 0.3920 | 0.5153    |
+| 1.8758        | 1.28  | 3000 | 1.8545          | 0.5929 | 0.1906 | 0.3929 | 0.5168    |
+| 1.8699        | 1.71  | 4000 | 1.8506          | 0.5938 | 0.1912 | 0.3937 | 0.5184    |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:486a9d5b5946068a1b5cbb803215779614f9ebe2c470104dbcc1cd587942ba5d
 size 326120926

 version https://git-lfs.github.com/spec/v1
+oid sha256:fa09caf64af0a9d4b7048c60be8edb1bfb1c8e2ff6e23208ca424c7e00993f3a
 size 326120926

runs/Nov14_08-11-28_ed58f05e541c/events.out.tfevents.1699949499.ed58f05e541c.10087.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f1db148953b6eee295f67bf3bc8f57b756fd040d526c020cd665cd106a189690
-size 19118

 version https://git-lfs.github.com/spec/v1
+oid sha256:7159197041453179d71ebda7cbbc58d8745015eb4cd29a9bb1c5b974a06a0a01
+size 21513