jvalline
/

randomization_model

@@ -18,10 +18,10 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9445
-- Bleu: 0.0
 - Accuracy: 0.0
-- Gen Len: 18.9976
 ## Model description
@@ -41,23 +41,53 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Bleu | Accuracy | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:----:|:--------:|:-------:|
-| 2.4068        | 1.0   | 6250 | 1.9445          | 0.0  | 0.0      | 18.9976 |
 ### Framework versions
-- Transformers 4.32.1
-- Pytorch 2.3.0.dev20240113
-- Datasets 2.12.0
-- Tokenizers 0.13.2

 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.2626
+- Bleu: 0.0001
 - Accuracy: 0.0
+- Gen Len: 18.999
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 10
+- eval_batch_size: 10
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
+- mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Bleu   | Accuracy | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:--------:|:-------:|
+| 3.0103        | 0.1   | 50   | 2.5132          | 0.0    | 0.0      | 18.9985 |
+| 2.999         | 0.2   | 100  | 2.4883          | 0.0    | 0.0      | 19.0    |
+| 2.9457        | 0.3   | 150  | 2.4640          | 0.0    | 0.0      | 19.0    |
+| 2.8865        | 0.4   | 200  | 2.4431          | 0.0    | 0.0      | 19.0    |
+| 2.8935        | 0.5   | 250  | 2.4240          | 0.0    | 0.0      | 19.0    |
+| 2.8983        | 0.6   | 300  | 2.4079          | 0.0    | 0.0      | 19.0    |
+| 2.8579        | 0.7   | 350  | 2.3933          | 0.0    | 0.0      | 19.0    |
+| 2.8501        | 0.8   | 400  | 2.3794          | 0.0    | 0.0      | 19.0    |
+| 2.7892        | 0.9   | 450  | 2.3683          | 0.0    | 0.0      | 19.0    |
+| 2.7962        | 1.0   | 500  | 2.3561          | 0.0    | 0.0      | 19.0    |
+| 2.8408        | 1.1   | 550  | 2.3456          | 0.0    | 0.0      | 19.0    |
+| 2.8049        | 1.2   | 600  | 2.3350          | 0.0001 | 0.0      | 19.0    |
+| 2.8051        | 1.3   | 650  | 2.3278          | 0.0001 | 0.0      | 19.0    |
+| 2.8126        | 1.4   | 700  | 2.3192          | 0.0001 | 0.0      | 19.0    |
+| 2.7689        | 1.5   | 750  | 2.3121          | 0.0001 | 0.0      | 19.0    |
+| 2.7559        | 1.6   | 800  | 2.3051          | 0.0001 | 0.0      | 18.9995 |
+| 2.7672        | 1.7   | 850  | 2.2978          | 0.0001 | 0.0      | 18.9985 |
+| 2.7901        | 1.8   | 900  | 2.2916          | 0.0001 | 0.0      | 18.9995 |
+| 2.7571        | 1.9   | 950  | 2.2868          | 0.0001 | 0.0      | 18.9985 |
+| 2.7796        | 2.0   | 1000 | 2.2834          | 0.0001 | 0.0      | 18.9985 |
+| 2.7393        | 2.1   | 1050 | 2.2798          | 0.0001 | 0.0      | 18.9985 |
+| 2.7309        | 2.2   | 1100 | 2.2757          | 0.0001 | 0.0      | 18.9985 |
+| 2.7703        | 2.3   | 1150 | 2.2729          | 0.0001 | 0.0      | 18.999  |
+| 2.7354        | 2.4   | 1200 | 2.2703          | 0.0001 | 0.0      | 18.999  |
+| 2.7428        | 2.5   | 1250 | 2.2678          | 0.0001 | 0.0      | 18.999  |
+| 2.7571        | 2.6   | 1300 | 2.2661          | 0.0001 | 0.0      | 18.999  |
+| 2.7218        | 2.7   | 1350 | 2.2645          | 0.0001 | 0.0      | 18.999  |
+| 2.7051        | 2.8   | 1400 | 2.2634          | 0.0001 | 0.0      | 18.999  |
+| 2.7466        | 2.9   | 1450 | 2.2628          | 0.0001 | 0.0      | 18.999  |
+| 2.722         | 3.0   | 1500 | 2.2626          | 0.0001 | 0.0      | 18.999  |
 ### Framework versions
+- Transformers 4.37.1
+- Pytorch 2.1.0+cu121
+- Datasets 2.16.1
+- Tokenizers 0.15.1

generation_config.json CHANGED Viewed

@@ -2,5 +2,5 @@
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,
-  "transformers_version": "4.32.1"
 }

   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,
+  "transformers_version": "4.37.1"
 }