mtzig commited on
Commit
a9b8f0f
·
verified ·
1 Parent(s): 7366985

Model save

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: nan
20
 
21
  ## Model description
22
 
@@ -50,8 +50,8 @@ The following hyperparameters were used during training:
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
- | No log | 0.4 | 2 | nan |
54
- | No log | 0.8 | 4 | nan |
55
 
56
 
57
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.5165
20
 
21
  ## Model description
22
 
 
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
+ | No log | 0.4 | 2 | 0.6340 |
54
+ | No log | 0.8 | 4 | 0.5165 |
55
 
56
 
57
  ### Framework versions
final/adapter_config.json CHANGED
@@ -24,8 +24,8 @@
24
  "rank_pattern": {},
25
  "revision": null,
26
  "target_modules": [
27
- "q_proj",
28
  "k_proj",
 
29
  "v_proj",
30
  "o_proj"
31
  ],
 
24
  "rank_pattern": {},
25
  "revision": null,
26
  "target_modules": [
 
27
  "k_proj",
28
+ "q_proj",
29
  "v_proj",
30
  "o_proj"
31
  ],
final/adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:18fa74c76360bf6f2f85ddad45632b9d6d43663653f830389241672b4616f51a
3
  size 9034304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c9c74dc405a2b8717ff50933f769b0dc5c03cb8b5f4b5b6d5b25e948f5a06aba
3
  size 9034304
final/tokenizer.json CHANGED
@@ -1,19 +1,7 @@
1
  {
2
  "version": "1.0",
3
- "truncation": {
4
- "direction": "Right",
5
- "max_length": 1024,
6
- "strategy": "LongestFirst",
7
- "stride": 0
8
- },
9
- "padding": {
10
- "strategy": "BatchLongest",
11
- "direction": "Right",
12
- "pad_to_multiple_of": null,
13
- "pad_id": 2,
14
- "pad_type_id": 0,
15
- "pad_token": "</s>"
16
- },
17
  "added_tokens": [
18
  {
19
  "id": 0,
 
1
  {
2
  "version": "1.0",
3
+ "truncation": null,
4
+ "padding": null,
 
 
 
 
 
 
 
 
 
 
 
 
5
  "added_tokens": [
6
  {
7
  "id": 0,
final/training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:67555874961dd2c97f3dddfe093aa697fa6bb50ffb490e3f81442c7944cd00ea
3
  size 5713
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b8d6ab70993f77f33eb2e4092599016b68a809c5f24fb7121c4556fe62b5f34f
3
  size 5713