roiyeho commited on
Commit
a5c213f
·
verified ·
1 Parent(s): 8d8bf09

Training on SAMSum complete!

Browse files
README.md CHANGED
@@ -2,7 +2,10 @@
2
  license: mit
3
  base_model: facebook/bart-large-cnn
4
  tags:
 
5
  - generated_from_trainer
 
 
6
  model-index:
7
  - name: bart-large-samsum
8
  results: []
@@ -15,7 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.3576
 
 
 
 
19
 
20
  ## Model description
21
 
@@ -46,17 +53,14 @@ The following hyperparameters were used during training:
46
 
47
  ### Training results
48
 
49
- | Training Loss | Epoch | Step | Validation Loss |
50
- |:-------------:|:-----:|:----:|:---------------:|
51
- | No log | 0.22 | 200 | 1.5108 |
52
- | No log | 0.43 | 400 | 1.4143 |
53
- | 1.3989 | 0.65 | 600 | 1.4067 |
54
- | 1.3989 | 0.87 | 800 | 1.3576 |
55
 
56
 
57
  ### Framework versions
58
 
59
- - Transformers 4.35.2
60
  - Pytorch 2.1.0+cu121
61
- - Datasets 2.17.0
62
  - Tokenizers 0.15.2
 
2
  license: mit
3
  base_model: facebook/bart-large-cnn
4
  tags:
5
+ - summarization
6
  - generated_from_trainer
7
+ metrics:
8
+ - rouge
9
  model-index:
10
  - name: bart-large-samsum
11
  results: []
 
18
 
19
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.3770
22
+ - Rouge1: 0.3912
23
+ - Rouge2: 0.1962
24
+ - Rougel: 0.2988
25
+ - Rougelsum: 0.2989
26
 
27
  ## Model description
28
 
 
53
 
54
  ### Training results
55
 
56
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
57
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
58
+ | 1.3353 | 0.54 | 500 | 1.4306 | 0.3925 | 0.1959 | 0.3017 | 0.3012 |
 
 
 
59
 
60
 
61
  ### Framework versions
62
 
63
+ - Transformers 4.38.2
64
  - Pytorch 2.1.0+cu121
65
+ - Datasets 2.18.0
66
  - Tokenizers 0.15.2
config.json CHANGED
@@ -64,7 +64,7 @@
64
  }
65
  },
66
  "torch_dtype": "float32",
67
- "transformers_version": "4.35.2",
68
  "use_cache": true,
69
  "vocab_size": 50264
70
  }
 
64
  }
65
  },
66
  "torch_dtype": "float32",
67
+ "transformers_version": "4.38.2",
68
  "use_cache": true,
69
  "vocab_size": 50264
70
  }
generation_config.json CHANGED
@@ -1,5 +1,4 @@
1
  {
2
- "_from_model_config": true,
3
  "bos_token_id": 0,
4
  "decoder_start_token_id": 2,
5
  "early_stopping": true,
@@ -12,5 +11,5 @@
12
  "no_repeat_ngram_size": 3,
13
  "num_beams": 4,
14
  "pad_token_id": 1,
15
- "transformers_version": "4.35.2"
16
  }
 
1
  {
 
2
  "bos_token_id": 0,
3
  "decoder_start_token_id": 2,
4
  "early_stopping": true,
 
11
  "no_repeat_ngram_size": 3,
12
  "num_beams": 4,
13
  "pad_token_id": 1,
14
+ "transformers_version": "4.38.2"
15
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f90884b9f26fbacd10a16efa74ba9c99a5e1c100dcecc254dc5d132aa247ff8c
3
  size 1625422896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dcbf5bbf77eff9ce3295c883d4023d244c28a0f36a91467e63d4c8bd3e5d2169
3
  size 1625422896
runs/Mar10_13-59-42_5b0bd46df8c8/events.out.tfevents.1710080014.5b0bd46df8c8.160.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2cffa519316b11d5084283274d15ca9ae40d759d0beb7b12fd13a876848215d8
3
+ size 6097
runs/Mar10_14-15-05_5b0bd46df8c8/events.out.tfevents.1710080109.5b0bd46df8c8.160.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1fa5a521692d37688067ca3dd68e73496740821649ed436c0c5afdc91769d64b
3
+ size 6371
runs/Mar10_14-34-28_5b0bd46df8c8/events.out.tfevents.1710081275.5b0bd46df8c8.160.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:04e0d47ad4211afd8a35159eef30f20abed546c50aed468aff6db27c8e94c0f2
3
+ size 8422
runs/Mar10_14-34-28_5b0bd46df8c8/events.out.tfevents.1710085463.5b0bd46df8c8.160.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc61b98be62c86473effb913ecc7ea3424dcd889e91b675a57a162a294cb6021
3
+ size 562
tokenizer.json CHANGED
@@ -2,7 +2,7 @@
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
- "max_length": 1024,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
 
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
+ "max_length": 128,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cf4e43fc370f522ca0c4def80371444d19824f149183f529799ef896bdea7b5b
3
- size 4536
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7dcc13853312b158aabbb2c79d0dff71aa98ca9e0fef4e82a30d7e03a3e3c1ad
3
+ size 5048