Incomple commited on
Commit
109ccbd
·
verified ·
1 Parent(s): df3c4d0

Model save

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -7,18 +7,18 @@ tags:
7
  - lora
8
  - generated_from_trainer
9
  model-index:
10
- - name: Sailor2-8B-Chat_sft_sg_values
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- # Sailor2-8B-Chat_sft_sg_values
18
 
19
- This model is a fine-tuned version of [sail/Sailor2-8B-Chat](https://huggingface.co/sail/Sailor2-8B-Chat) on the sft_sg_values dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.1354
22
 
23
  ## Model description
24
 
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:------:|:----:|:---------------:|
55
- | 0.4422 | 0.1869 | 250 | 0.2675 |
56
- | 0.1727 | 0.3738 | 500 | 0.1680 |
57
- | 0.1475 | 0.5607 | 750 | 0.1457 |
58
- | 0.1284 | 0.7477 | 1000 | 0.1373 |
59
- | 0.1066 | 0.9346 | 1250 | 0.1348 |
60
 
61
 
62
  ### Framework versions
 
7
  - lora
8
  - generated_from_trainer
9
  model-index:
10
+ - name: Sailor2-8B-Chat_sft_sg_values_resp_split
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ # Sailor2-8B-Chat_sft_sg_values_resp_split
18
 
19
+ This model is a fine-tuned version of [sail/Sailor2-8B-Chat](https://huggingface.co/sail/Sailor2-8B-Chat) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.1359
22
 
23
  ## Model description
24
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:------:|:----:|:---------------:|
55
+ | 0.3715 | 0.1710 | 250 | 0.2821 |
56
+ | 0.1642 | 0.3419 | 500 | 0.1734 |
57
+ | 0.1431 | 0.5129 | 750 | 0.1512 |
58
+ | 0.1257 | 0.6839 | 1000 | 0.1415 |
59
+ | 0.1281 | 0.8548 | 1250 | 0.1359 |
60
 
61
 
62
  ### Framework versions