adil0101 commited on
Commit
71c0dfb
·
verified ·
1 Parent(s): bef88d2

adil0101/falcon_instruct_generation

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.5623
24
 
25
  ## Model description
26
 
@@ -53,17 +53,17 @@ The following hyperparameters were used during training:
53
 
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:-----:|:----:|:---------------:|
56
- | 0.6959 | 1.0 | 20 | 0.6080 |
57
- | 0.6022 | 2.0 | 40 | 0.5696 |
58
- | 0.5497 | 3.0 | 60 | 0.5566 |
59
- | 0.4953 | 4.0 | 80 | 0.5542 |
60
- | 0.4469 | 5.0 | 100 | 0.5623 |
61
 
62
 
63
  ### Framework versions
64
 
65
  - PEFT 0.10.0
66
- - Transformers 4.39.3
67
  - Pytorch 2.2.2+cu121
68
  - Datasets 2.18.0
69
- - Tokenizers 0.15.2
 
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.5379
24
 
25
  ## Model description
26
 
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:-----:|:----:|:---------------:|
56
+ | 0.7039 | 1.0 | 20 | 0.5936 |
57
+ | 0.5821 | 2.0 | 40 | 0.5534 |
58
+ | 0.5477 | 3.0 | 60 | 0.5422 |
59
+ | 0.4996 | 4.0 | 80 | 0.5402 |
60
+ | 0.4485 | 5.0 | 100 | 0.5379 |
61
 
62
 
63
  ### Framework versions
64
 
65
  - PEFT 0.10.0
66
+ - Transformers 4.40.0
67
  - Pytorch 2.2.2+cu121
68
  - Datasets 2.18.0
69
+ - Tokenizers 0.19.1
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f35ea1f109783c5796c1cc37533ba82a7117d6ef640ae0b5602dc6f483e5fb7d
3
  size 109069176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cb4678f4cf156e9c32fe3e380e02e93105c10979d2231e8c98ceebe4524f28fd
3
  size 109069176
runs/Apr18_22-45-14_ebeca3ee6be0/events.out.tfevents.1713480316.ebeca3ee6be0.4873.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d0d774f279e09433848d4279f70887767bb03a73769e37788e986795cdf6f0e8
3
+ size 8932
tokenizer.json CHANGED
@@ -134,6 +134,7 @@
134
  "end_of_word_suffix": null,
135
  "fuse_unk": true,
136
  "byte_fallback": true,
 
137
  "vocab": {
138
  "<unk>": 0,
139
  "<s>": 1,
 
134
  "end_of_word_suffix": null,
135
  "fuse_unk": true,
136
  "byte_fallback": true,
137
+ "ignore_merges": false,
138
  "vocab": {
139
  "<unk>": 0,
140
  "<s>": 1,
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d371f1bf33f7d6dc02637fd323722f184e27363eef225770e2dfd2dc86fda916
3
- size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a3857492eb571d37c27a3e8780ae5baa138d7434de7f221d135538764c2ce22
3
+ size 5048