ccore
/

gpt2_ACoT

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

ccore commited on Jul 27, 2025

Commit

568dd2c

·

verified ·

1 Parent(s): d9fa30c

End of training

Files changed (2) hide show

README.md +12 -5
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.8132
 ## Model description
@@ -43,16 +43,23 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 16
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 6    | 5.9545          |
-| No log        | 2.0   | 12   | 4.9997          |
-| No log        | 3.0   | 18   | 4.8132          |
 ### Framework versions

 This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.9497
 ## Model description
 - total_train_batch_size: 16
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 21.1265       | 1.0   | 276  | 2.5543          |
+| 18.2347       | 2.0   | 552  | 2.2861          |
+| 16.8533       | 3.0   | 828  | 2.1435          |
+| 15.6224       | 4.0   | 1104 | 2.0605          |
+| 14.7096       | 5.0   | 1380 | 2.0051          |
+| 14.1231       | 6.0   | 1656 | 1.9733          |
+| 13.5673       | 7.0   | 1932 | 1.9566          |
+| 13.1727       | 8.0   | 2208 | 1.9493          |
+| 13.0597       | 9.0   | 2484 | 1.9492          |
+| 12.9195       | 10.0  | 2760 | 1.9497          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:400714933e436ed4942f6ac14653ba65257ac6b5a7d4ce477d416ba55e17474e
 size 504109968

 version https://git-lfs.github.com/spec/v1
+oid sha256:18aa3793eafce3749c6627f617fc9beefd20e5e39a72c4be54bc7466a8da3a58
 size 504109968