ryan-northell commited on
Commit
3b8b393
·
verified ·
1 Parent(s): d680d27

Northell/phi-3.5-adapter

Browse files
Files changed (1) hide show
  1. README.md +3 -10
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.0000
22
 
23
  ## Model description
24
 
@@ -45,20 +45,13 @@ The following hyperparameters were used during training:
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: cosine
47
  - lr_scheduler_warmup_ratio: 0.2
48
- - training_steps: 800
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:------:|:----:|:---------------:|
54
- | 0.0429 | 0.2222 | 100 | 0.0164 |
55
- | 0.0002 | 0.4444 | 200 | 0.0001 |
56
- | 0.0 | 0.6667 | 300 | 0.0000 |
57
- | 0.0 | 0.8889 | 400 | 0.0000 |
58
- | 0.0 | 1.1111 | 500 | 0.0000 |
59
- | 0.0 | 1.3333 | 600 | 0.0000 |
60
- | 0.0 | 1.5556 | 700 | 0.0000 |
61
- | 0.0 | 1.7778 | 800 | 0.0000 |
62
 
63
 
64
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.0011
22
 
23
  ## Model description
24
 
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: cosine
47
  - lr_scheduler_warmup_ratio: 0.2
48
+ - training_steps: 100
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:------:|:----:|:---------------:|
54
+ | 0.0007 | 0.2222 | 100 | 0.0011 |
 
 
 
 
 
 
 
55
 
56
 
57
  ### Framework versions