nightner commited on
Commit
4955edb
·
verified ·
1 Parent(s): 5af1d07

nightner/roberta2roberta_financial_lora_v1_small

Browse files
README.md CHANGED
@@ -16,16 +16,16 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # results
18
 
19
- This model is a fine-tuned version of [google/roberta2roberta_L-24_cnn_daily_mail](https://huggingface.co/google/roberta2roberta_L-24_cnn_daily_mail) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 2.3415
22
- - Rouge1: 39.24
23
- - Rouge2: 20.62
24
- - Rougel: 29.42
25
- - Rougelsum: 29.35
26
- - Bertscore P: 84.82
27
- - Bertscore R: 85.16
28
- - Bertscore F1: 84.9
29
 
30
  ## Model description
31
 
@@ -45,22 +45,23 @@ More information needed
45
 
46
  The following hyperparameters were used during training:
47
  - learning_rate: 3e-05
48
- - train_batch_size: 1
49
- - eval_batch_size: 1
50
  - seed: 42
51
  - gradient_accumulation_steps: 4
52
- - total_train_batch_size: 4
53
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
54
  - lr_scheduler_type: cosine
55
  - lr_scheduler_warmup_ratio: 0.1
56
- - num_epochs: 2
57
 
58
  ### Training results
59
 
60
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bertscore P | Bertscore R | Bertscore F1 |
61
- |:-------------:|:------:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-----------:|:-----------:|:------------:|
62
- | 13.8236 | 0.7251 | 300 | 2.8115 | 36.62 | 19.66 | 27.32 | 27.25 | 83.21 | 84.38 | 83.62 |
63
- | 10.8258 | 1.4495 | 600 | 2.3415 | 39.24 | 20.62 | 29.42 | 29.35 | 84.82 | 85.16 | 84.9 |
 
64
 
65
 
66
  ### Framework versions
 
16
 
17
  # results
18
 
19
+ This model is a fine-tuned version of [google/roberta2roberta_L-24_cnn_daily_mail](https://huggingface.co/google/roberta2roberta_L-24_cnn_daily_mail) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 6.7395
22
+ - Rouge1: 33.11
23
+ - Rouge2: 20.39
24
+ - Rougel: 27.32
25
+ - Rougelsum: 27.42
26
+ - Bertscore P: 87.57
27
+ - Bertscore R: 83.35
28
+ - Bertscore F1: 85.27
29
 
30
  ## Model description
31
 
 
45
 
46
  The following hyperparameters were used during training:
47
  - learning_rate: 3e-05
48
+ - train_batch_size: 2
49
+ - eval_batch_size: 2
50
  - seed: 42
51
  - gradient_accumulation_steps: 4
52
+ - total_train_batch_size: 8
53
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
54
  - lr_scheduler_type: cosine
55
  - lr_scheduler_warmup_ratio: 0.1
56
+ - num_epochs: 3
57
 
58
  ### Training results
59
 
60
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bertscore P | Bertscore R | Bertscore F1 |
61
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-----------:|:-----------:|:------------:|
62
+ | 36.5558 | 0.8 | 20 | 7.8969 | 32.35 | 20.37 | 27.74 | 27.82 | 87.74 | 83.51 | 85.43 |
63
+ | 32.2661 | 1.6 | 40 | 7.0747 | 34.94 | 21.86 | 29.83 | 30.04 | 87.7 | 83.93 | 85.62 |
64
+ | 30.3129 | 2.4 | 60 | 6.7395 | 33.11 | 20.39 | 27.32 | 27.42 | 87.57 | 83.35 | 85.27 |
65
 
66
 
67
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:76885af14102f0fcc29d0c8dbb0b27b2e1d39760a59551a69fd98a1063d65526
3
  size 12611432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0b9de919c6a1c38c9cd8fd0d1b2c9b59b7a64ab050fffda96b5a59b49d82d803
3
  size 12611432
runs/Feb17_12-24-42_0bfd00035eae/events.out.tfevents.1739795132.0bfd00035eae.2888.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7f1a8e71f4544298c4c105d3c9bfc75397e0a344f352d607201a7f9025ab89a7
3
+ size 9384
runs/Feb17_12-30-54_0bfd00035eae/events.out.tfevents.1739795476.0bfd00035eae.2888.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5658e7395cbb99f57ebf5d20ace7031d79bffa2b9129db5c306723c24e1d4926
3
+ size 13272
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3e16beb67675cca3530aabca05d390cc84b64ffac7516c9386be2263bca9606c
3
  size 5432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13b7c44cc55c6c221920be5f2abeaac3412ad878f406b2e5e2f5d78b4a89c1a9
3
  size 5432