Incomple commited on
Commit
8b07b8b
·
verified ·
1 Parent(s): 4cf1eca

Model save

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -7,18 +7,18 @@ tags:
7
  - lora
8
  - generated_from_trainer
9
  model-index:
10
- - name: Phi-4-mini-instruct_sft_sg_values
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- # Phi-4-mini-instruct_sft_sg_values
18
 
19
- This model is a fine-tuned version of [microsoft/Phi-4-mini-instruct](https://huggingface.co/microsoft/Phi-4-mini-instruct) on the sft_sg_values dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 2.3848
22
 
23
  ## Model description
24
 
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:------:|:----:|:---------------:|
55
- | 4.3614 | 0.1869 | 250 | 4.1655 |
56
- | 3.3914 | 0.3738 | 500 | 3.1697 |
57
- | 2.684 | 0.5607 | 750 | 2.6262 |
58
- | 2.5099 | 0.7477 | 1000 | 2.4442 |
59
- | 2.3706 | 0.9346 | 1250 | 2.3880 |
60
 
61
 
62
  ### Framework versions
 
7
  - lora
8
  - generated_from_trainer
9
  model-index:
10
+ - name: Phi-4-mini-instruct_sft_sg_values_resp_split
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ # Phi-4-mini-instruct_sft_sg_values_resp_split
18
 
19
+ This model is a fine-tuned version of [microsoft/Phi-4-mini-instruct](https://huggingface.co/microsoft/Phi-4-mini-instruct) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 2.3345
22
 
23
  ## Model description
24
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:------:|:----:|:---------------:|
55
+ | 4.3698 | 0.1710 | 250 | 4.1731 |
56
+ | 3.5096 | 0.3419 | 500 | 3.1497 |
57
+ | 2.6213 | 0.5129 | 750 | 2.5736 |
58
+ | 2.4305 | 0.6839 | 1000 | 2.3980 |
59
+ | 2.3653 | 0.8548 | 1250 | 2.3345 |
60
 
61
 
62
  ### Framework versions