Ftmhd commited on
Commit
b7474d2
·
verified ·
1 Parent(s): e289251

End of training

Browse files
README.md CHANGED
@@ -4,6 +4,8 @@ license: apache-2.0
4
  base_model: t5-small
5
  tags:
6
  - generated_from_trainer
 
 
7
  model-index:
8
  - name: t5-small-finetuned-news
9
  results: []
@@ -15,6 +17,13 @@ should probably proofread and complete it, then remove this comment. -->
15
  # t5-small-finetuned-news
16
 
17
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 
 
 
 
 
 
 
18
 
19
  ## Model description
20
 
@@ -37,21 +46,22 @@ The following hyperparameters were used during training:
37
  - train_batch_size: 16
38
  - eval_batch_size: 16
39
  - seed: 42
40
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
- - num_epochs: 1
43
  - mixed_precision_training: Native AMP
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
48
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
49
- | No log | 1.0 | 332 | 1.3274 | 54.9613 | 35.0144 | 52.1856 | 52.1436 | 16.152 |
 
50
 
51
 
52
  ### Framework versions
53
 
54
- - Transformers 4.44.2
55
- - Pytorch 2.4.1+cu121
56
- - Datasets 3.0.1
57
- - Tokenizers 0.19.1
 
4
  base_model: t5-small
5
  tags:
6
  - generated_from_trainer
7
+ metrics:
8
+ - rouge
9
  model-index:
10
  - name: t5-small-finetuned-news
11
  results: []
 
17
  # t5-small-finetuned-news
18
 
19
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 1.5749
22
+ - Rouge1: 43.7874
23
+ - Rouge2: 24.2639
24
+ - Rougel: 40.5888
25
+ - Rougelsum: 40.5008
26
+ - Gen Len: 18.6475
27
 
28
  ## Model description
29
 
 
46
  - train_batch_size: 16
47
  - eval_batch_size: 16
48
  - seed: 42
49
+ - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
+ - num_epochs: 2
52
  - mixed_precision_training: Native AMP
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
58
+ | No log | 1.0 | 175 | 1.5966 | 42.795 | 23.6707 | 39.6859 | 39.6641 | 18.6115 |
59
+ | No log | 2.0 | 350 | 1.5749 | 43.7874 | 24.2639 | 40.5888 | 40.5008 | 18.6475 |
60
 
61
 
62
  ### Framework versions
63
 
64
+ - Transformers 4.46.2
65
+ - Pytorch 2.5.1+cu121
66
+ - Datasets 3.1.0
67
+ - Tokenizers 0.20.3
generation_config.json CHANGED
@@ -2,5 +2,5 @@
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
  "pad_token_id": 0,
5
- "transformers_version": "4.44.2"
6
  }
 
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
  "pad_token_id": 0,
5
+ "transformers_version": "4.46.2"
6
  }
runs/Nov18_13-24-15_2318ff6358b5/events.out.tfevents.1731936273.2318ff6358b5.481.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2891c1aceea00d1e4e5ed45c2bb00c08bb841e20c509ec1f136148d5a05522e1
3
- size 12533
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:010ab9003c7df38aa42f0a10964c00c39d0bcdf60ec303c56485ed8857f1dacc
3
+ size 13412
tokenizer.json CHANGED
@@ -2,11 +2,18 @@
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
- "max_length": 128,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
- "padding": null,
 
 
 
 
 
 
 
10
  "added_tokens": [
11
  {
12
  "id": 0,
 
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
+ "max_length": 512,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
+ "padding": {
10
+ "strategy": "BatchLongest",
11
+ "direction": "Right",
12
+ "pad_to_multiple_of": null,
13
+ "pad_id": 0,
14
+ "pad_type_id": 0,
15
+ "pad_token": "<pad>"
16
+ },
17
  "added_tokens": [
18
  {
19
  "id": 0,