dotslashderek commited on
Commit
426eab7
·
verified ·
1 Parent(s): 64dfc1b

End of training

Browse files
Files changed (2) hide show
  1. README.md +23 -16
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,31 +1,31 @@
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
- base_model: google/mt5-small
5
  tags:
6
  - generated_from_trainer
7
  metrics:
8
  - rouge
9
  model-index:
10
- - name: light-compression-multilingual
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- # light-compression-multilingual
18
 
19
- This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 7.4457
22
- - Rouge1: 0.0150
23
- - Rouge2: 0.0002
24
- - Rougel: 0.0149
25
- - Rougelsum: 0.0149
26
- - Comp Ratio Mean: 0.1324
27
- - Comp Ratio P90: 0.2632
28
- - Pct Violations: 0.0032
29
 
30
  ## Model description
31
 
@@ -52,15 +52,22 @@ The following hyperparameters were used during training:
52
  No additional optimizer arguments
53
  - lr_scheduler_type: linear
54
  - lr_scheduler_warmup_ratio: 0.1
55
- - num_epochs: 4
56
 
57
  ### Training results
58
 
59
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Comp Ratio Mean | Comp Ratio P90 | Pct Violations |
60
  |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:---------------:|:--------------:|:--------------:|
61
- | 18.5739 | 1.0 | 8074 | 8.8190 | 0.0220 | 0.0006 | 0.0217 | 0.0217 | 0.2520 | 0.4667 | 0.0170 |
62
- | 11.7884 | 2.0 | 16148 | 7.7442 | 0.0163 | 0.0002 | 0.0162 | 0.0162 | 0.1679 | 0.3333 | 0.0068 |
63
- | 10.14 | 3.0 | 24222 | 7.4457 | 0.0150 | 0.0002 | 0.0149 | 0.0149 | 0.1324 | 0.2632 | 0.0032 |
 
 
 
 
 
 
 
64
 
65
 
66
  ### Framework versions
 
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
+ base_model: google/flan-t5-small
5
  tags:
6
  - generated_from_trainer
7
  metrics:
8
  - rouge
9
  model-index:
10
+ - name: flan-t5-small-prompt-compression
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ # flan-t5-small-prompt-compression
18
 
19
+ This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.5181
22
+ - Rouge1: 0.8820
23
+ - Rouge2: 0.7104
24
+ - Rougel: 0.8485
25
+ - Rougelsum: 0.8488
26
+ - Comp Ratio Mean: 0.6611
27
+ - Comp Ratio P90: 0.7674
28
+ - Pct Violations: 0.0
29
 
30
  ## Model description
31
 
 
52
  No additional optimizer arguments
53
  - lr_scheduler_type: linear
54
  - lr_scheduler_warmup_ratio: 0.1
55
+ - num_epochs: 10
56
 
57
  ### Training results
58
 
59
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Comp Ratio Mean | Comp Ratio P90 | Pct Violations |
60
  |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:---------------:|:--------------:|:--------------:|
61
+ | 1.2576 | 1.0 | 1594 | 0.6457 | 0.8528 | 0.6587 | 0.8197 | 0.8199 | 0.6626 | 0.7736 | 0.0 |
62
+ | 0.7688 | 2.0 | 3188 | 0.5727 | 0.8689 | 0.6851 | 0.8345 | 0.8349 | 0.6647 | 0.7694 | 0.0 |
63
+ | 0.6591 | 3.0 | 4782 | 0.5405 | 0.8750 | 0.6963 | 0.8413 | 0.8417 | 0.6684 | 0.7692 | 0.0 |
64
+ | 0.5957 | 4.0 | 6376 | 0.5333 | 0.8771 | 0.7002 | 0.8438 | 0.8440 | 0.6600 | 0.7660 | 0.0 |
65
+ | 0.548 | 5.0 | 7970 | 0.5212 | 0.8792 | 0.7059 | 0.8467 | 0.8470 | 0.6617 | 0.7648 | 0.0004 |
66
+ | 0.5139 | 6.0 | 9564 | 0.5196 | 0.8799 | 0.7064 | 0.8472 | 0.8473 | 0.6597 | 0.7636 | 0.0 |
67
+ | 0.4862 | 7.0 | 11158 | 0.5144 | 0.8805 | 0.7076 | 0.8473 | 0.8474 | 0.6656 | 0.7705 | 0.0004 |
68
+ | 0.466 | 8.0 | 12752 | 0.5157 | 0.8819 | 0.7098 | 0.8489 | 0.8492 | 0.6622 | 0.7674 | 0.0 |
69
+ | 0.4499 | 9.0 | 14346 | 0.5156 | 0.8816 | 0.7096 | 0.8486 | 0.8489 | 0.6604 | 0.7660 | 0.0 |
70
+ | 0.4393 | 10.0 | 15940 | 0.5181 | 0.8820 | 0.7104 | 0.8485 | 0.8488 | 0.6611 | 0.7674 | 0.0 |
71
 
72
 
73
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8ed0e3967fe915c600dcd41e3a617bae251601ef6d520b6fe5cf77f7b70b0e43
3
  size 307867048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1d0990e7ac2bc48f592f2a0bda7875a1811acd8478f18c42a520af533a962ef3
3
  size 307867048