ManyaGupta commited on
Commit
d7b3d2c
·
verified ·
1 Parent(s): 9445346

Model save

Browse files
README.md CHANGED
@@ -1,15 +1,15 @@
1
- ---
2
- base_model: bigscience/bloomz-560m
3
- library_name: peft
4
- license: bigscience-bloom-rail-1.0
5
- tags:
6
- - trl
7
- - sft
8
- - generated_from_trainer
9
- model-index:
10
- - name: bloom_ts2
11
- results: []
12
- ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [bigscience/bloomz-560m](https://huggingface.co/bigscience/bloomz-560m) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 4.0056
22
 
23
  ## Model description
24
 
@@ -51,17 +51,17 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
- | 5.105 | 1.0 | 25 | 4.3496 |
55
- | 4.7363 | 2.0 | 50 | 4.1841 |
56
- | 4.528 | 3.0 | 75 | 4.0786 |
57
- | 4.4779 | 4.0 | 100 | 4.0237 |
58
- | 4.5107 | 5.0 | 125 | 4.0056 |
59
 
60
 
61
  ### Framework versions
62
 
63
  - PEFT 0.12.0
64
  - Transformers 4.44.2
65
- - Pytorch 2.4.0+cu121
66
  - Datasets 3.0.0
67
  - Tokenizers 0.19.1
 
1
+ ---
2
+ base_model: bigscience/bloomz-560m
3
+ library_name: peft
4
+ license: bigscience-bloom-rail-1.0
5
+ tags:
6
+ - trl
7
+ - sft
8
+ - generated_from_trainer
9
+ model-index:
10
+ - name: bloom_ts2
11
+ results: []
12
+ ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
 
18
 
19
  This model is a fine-tuned version of [bigscience/bloomz-560m](https://huggingface.co/bigscience/bloomz-560m) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 4.7031
22
 
23
  ## Model description
24
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
+ | 4.9365 | 1.0 | 25 | 4.6211 |
55
+ | 4.7031 | 2.0 | 50 | 4.6562 |
56
+ | 4.7573 | 3.0 | 75 | 4.6953 |
57
+ | 4.3525 | 4.0 | 100 | 4.7031 |
58
+ | 4.8447 | 5.0 | 125 | 4.7031 |
59
 
60
 
61
  ### Framework versions
62
 
63
  - PEFT 0.12.0
64
  - Transformers 4.44.2
65
+ - Pytorch 2.2.1
66
  - Datasets 3.0.0
67
  - Tokenizers 0.19.1
runs/Sep15_20-39-30_LAPTOP-1QK69S9G/events.out.tfevents.1726412986.LAPTOP-1QK69S9G.4776.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ecab320f1fdd5b0bf96c4a66e3a4904ff379f2cea1b36e7afcb083198668c188
3
- size 32248
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:efc2bf25763516a28808b24d7ae5744d0a216d5ae30beff77d9d881c3f12356d
3
+ size 32862