lvcalucioli commited on
Commit
b0fba33
·
verified ·
1 Parent(s): a6e1c91

ca-finetuned-phi-2

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 1.9920
22
  - Perplexity: 0.0000
23
 
24
  ## Model description
@@ -53,14 +53,16 @@ The following hyperparameters were used during training:
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Perplexity |
55
  |:-------------:|:-----:|:----:|:---------------:|:----------:|
56
- | No log | 1.0 | 1 | 2.3882 | 0.0000 |
57
- | No log | 2.0 | 2 | 2.2933 | 0.0000 |
58
- | No log | 3.0 | 4 | 2.1606 | 0.0000 |
59
- | No log | 4.0 | 5 | 2.0964 | 0.0000 |
60
- | No log | 5.0 | 6 | 2.0452 | 0.0000 |
61
- | No log | 6.0 | 8 | 2.0022 | 0.0000 |
62
- | No log | 7.0 | 9 | 1.9943 | 0.0000 |
63
- | No log | 8.0 | 10 | 1.9920 | 0.0000 |
 
 
64
 
65
 
66
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.7293
22
  - Perplexity: 0.0000
23
 
24
  ## Model description
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Perplexity |
55
  |:-------------:|:-----:|:----:|:---------------:|:----------:|
56
+ | No log | 0.97 | 7 | 2.4415 | 0.0000 |
57
+ | No log | 1.95 | 14 | 2.1788 | 0.0000 |
58
+ | 2.5859 | 2.92 | 21 | 2.0108 | 0.0000 |
59
+ | 2.5859 | 3.9 | 28 | 1.9063 | 0.0000 |
60
+ | 2.0806 | 4.87 | 35 | 1.8347 | 0.0000 |
61
+ | 2.0806 | 5.98 | 43 | 1.7810 | 0.0000 |
62
+ | 1.84 | 6.96 | 50 | 1.7497 | 0.0000 |
63
+ | 1.84 | 7.93 | 57 | 1.7358 | 0.0000 |
64
+ | 1.7413 | 8.9 | 64 | 1.7306 | 0.0000 |
65
+ | 1.7413 | 9.74 | 70 | 1.7293 | 0.0000 |
66
 
67
 
68
  ### Framework versions
adapter_config.json CHANGED
@@ -19,8 +19,8 @@
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
- "fc1",
23
  "fc2",
 
24
  "Wqkv"
25
  ],
26
  "task_type": "CAUSAL_LM",
 
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
 
22
  "fc2",
23
+ "fc1",
24
  "Wqkv"
25
  ],
26
  "task_type": "CAUSAL_LM",
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1432ae232ff114e9236553c8e1b7fb40e331a47b287af43df046e4c7a42bbcbb
3
  size 146825352
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7da1c87977f29bb80d79cab7c2ff73c7efa3d4fa997857e93588de8f87d16732
3
  size 146825352
tokenizer.json CHANGED
@@ -2,7 +2,7 @@
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
- "max_length": 512,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
 
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
+ "max_length": 364,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:56de8dff970b4a55858bf24335ebf386cad56ec256f65a50c85e55523233977c
3
  size 4347
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3ff98dac56699d34eb15c09aa1525894f3514760317ece52cb2ce3e2477dc9a0
3
  size 4347