jvalline commited on
Commit
fb3c964
·
verified ·
1 Parent(s): 36a9fe8

Model save

Browse files
Files changed (2) hide show
  1. README.md +16 -12
  2. generation_config.json +1 -1
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: t5-small
4
  tags:
5
  - generated_from_trainer
6
  metrics:
@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # randomization_model
18
 
19
- This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 1.9445
22
  - Bleu: 0.0
23
  - Accuracy: 0.0
24
- - Gen Len: 18.9976
25
 
26
  ## Model description
27
 
@@ -46,18 +46,22 @@ The following hyperparameters were used during training:
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
- - num_epochs: 1
 
50
 
51
  ### Training results
52
 
53
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Accuracy | Gen Len |
54
- |:-------------:|:-----:|:----:|:---------------:|:----:|:--------:|:-------:|
55
- | 2.4068 | 1.0 | 6250 | 1.9445 | 0.0 | 0.0 | 18.9976 |
 
 
 
56
 
57
 
58
  ### Framework versions
59
 
60
- - Transformers 4.32.1
61
- - Pytorch 2.3.0.dev20240113
62
- - Datasets 2.12.0
63
- - Tokenizers 0.13.2
 
1
  ---
2
  license: apache-2.0
3
+ base_model: t5-base
4
  tags:
5
  - generated_from_trainer
6
  metrics:
 
16
 
17
  # randomization_model
18
 
19
+ This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: nan
22
  - Bleu: 0.0
23
  - Accuracy: 0.0
24
+ - Gen Len: 0.0
25
 
26
  ## Model description
27
 
 
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 4
50
+ - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Accuracy | Gen Len |
55
+ |:-------------:|:-----:|:-----:|:---------------:|:------:|:--------:|:-------:|
56
+ | 1.8655 | 1.0 | 6250 | 1.4874 | 0.0001 | 0.0 | 18.9988 |
57
+ | 1.8508 | 2.0 | 12500 | 1.4878 | 0.0001 | 0.0 | 18.9988 |
58
+ | 1.8781 | 3.0 | 18750 | nan | 0.0 | 0.0 | 0.0 |
59
+ | 0.0 | 4.0 | 25000 | nan | 0.0 | 0.0 | 0.0 |
60
 
61
 
62
  ### Framework versions
63
 
64
+ - Transformers 4.37.1
65
+ - Pytorch 2.3.0.dev20240122+cu121
66
+ - Datasets 2.16.1
67
+ - Tokenizers 0.15.1
generation_config.json CHANGED
@@ -2,5 +2,5 @@
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
  "pad_token_id": 0,
5
- "transformers_version": "4.32.1"
6
  }
 
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
  "pad_token_id": 0,
5
+ "transformers_version": "4.37.1"
6
  }