Yirui091
/

lab1_random

text2text-generation

Generated from Trainer

Eval Results (legacy)

Model card Files Files and versions

Yirui091 commited on 21 days ago

Commit

a3d58ce

·

verified ·

1 Parent(s): 006190d

Yirui091/lab1_random

Files changed (2) hide show

README.md +7 -7
generation_config.json +3 -4

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ model-index:
     metrics:
     - name: Bleu
       type: bleu
-      value: 6.636251305602728
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,9 +33,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-fr](https://huggingface.co/Helsinki-NLP/opus-mt-en-fr) on the kde4 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.2567
-- Model Preparation Time: 0.0113
-- Bleu: 6.6363
 ## Model description
@@ -58,7 +58,7 @@ The following hyperparameters were used during training:
 - train_batch_size: 16
 - eval_batch_size: 32
 - seed: 42
-- optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - training_steps: 5000
 - mixed_precision_training: Native AMP
@@ -69,7 +69,7 @@ The following hyperparameters were used during training:
 ### Framework versions
-- Transformers 4.57.0
 - Pytorch 2.10.0+cu128
 - Datasets 2.18.0
-- Tokenizers 0.22.2

     metrics:
     - name: Bleu
       type: bleu
+      value: 6.9589599042040895
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-fr](https://huggingface.co/Helsinki-NLP/opus-mt-en-fr) on the kde4 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.2435
+- Model Preparation Time: 0.0032
+- Bleu: 6.9590
 ## Model description
 - train_batch_size: 16
 - eval_batch_size: 32
 - seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - training_steps: 5000
 - mixed_precision_training: Native AMP
 ### Framework versions
+- Transformers 4.45.2
 - Pytorch 2.10.0+cu128
 - Datasets 2.18.0
+- Tokenizers 0.20.3

generation_config.json CHANGED Viewed

@@ -4,13 +4,12 @@
       59513
     ]
   ],
   "decoder_start_token_id": 59513,
-  "eos_token_id": [
-    0
-  ],
   "forced_eos_token_id": 0,
   "max_length": 512,
   "num_beams": 4,
   "pad_token_id": 59513,
-  "transformers_version": "4.57.0"
 }

       59513
     ]
   ],
+  "bos_token_id": 0,
   "decoder_start_token_id": 59513,
+  "eos_token_id": 0,
   "forced_eos_token_id": 0,
   "max_length": 512,
   "num_beams": 4,
   "pad_token_id": 59513,
+  "transformers_version": "4.45.2"
 }