lengocquangLAB
/

T5-JSON-OM-IMP

text2text-generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

lengocquangLAB commited on Apr 10, 2025

Commit

ce573a6

·

verified ·

1 Parent(s): d724694

End of training

Files changed (2) hide show

README.md +7 -6
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -19,15 +19,15 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [T5-small](https://huggingface.co/T5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 10.2871
 - Micro Precision: 0
 - Micro Recall: 0.0
 - Micro F1: 0
 - Macro Precision: 0.0
 - Macro Recall: 0.0
 - Macro F1: 0
-- Bleu: 0.0236
-- Rouge1: 0.0145
 - Rouge2: 0.0
 ## Model description
@@ -48,8 +48,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
@@ -60,7 +60,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Micro Precision | Micro Recall | Micro F1 | Macro Precision | Macro Recall | Macro F1 | Bleu   | Rouge1 | Rouge2 |
 |:-------------:|:------:|:----:|:---------------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|:--------:|:------:|:------:|:------:|
-| 15.4491       | 7.1429 | 50   | 10.2871         | 0               | 0.0          | 0        | 0.0             | 0.0          | 0        | 0.0236 | 0.0145 | 0.0    |
 ### Framework versions

 This model is a fine-tuned version of [T5-small](https://huggingface.co/T5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9927
 - Micro Precision: 0
 - Micro Recall: 0.0
 - Micro F1: 0
 - Macro Precision: 0.0
 - Macro Recall: 0.0
 - Macro F1: 0
+- Bleu: 0.0243
+- Rouge1: 0.0133
 - Rouge2: 0.0
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 | Training Loss | Epoch  | Step | Validation Loss | Micro Precision | Micro Recall | Micro F1 | Macro Precision | Macro Recall | Macro F1 | Bleu   | Rouge1 | Rouge2 |
 |:-------------:|:------:|:----:|:---------------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|:--------:|:------:|:------:|:------:|
+| 15.1542       | 3.8462 | 50   | 8.6615          | 0               | 0.0          | 0        | 0.0             | 0.0          | 0        | 0.0228 | 0.0126 | 0.0    |
+| 4.6708        | 7.6923 | 100  | 0.9927          | 0               | 0.0          | 0        | 0.0             | 0.0          | 0        | 0.0243 | 0.0133 | 0.0    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3ef12ba2ecdc04ec6c615d79d2b87e5c1541e173c81bdec8fd110d2197ffb847
 size 242041896

 version https://git-lfs.github.com/spec/v1
+oid sha256:8bb6849fbeb917bd800e61b24cc52bcf16c3a181db7f91a387b401d5f6540fed
 size 242041896