EricWesthoff commited on
Commit
c310aa2
·
1 Parent(s): 0af28c3

End of training

Browse files
Files changed (1) hide show
  1. README.md +20 -3
README.md CHANGED
@@ -14,6 +14,8 @@ should probably proofread and complete it, then remove this comment. -->
14
  # phi-1_5-finetuned-SQL-2
15
 
16
  This model is a fine-tuned version of [microsoft/phi-1_5](https://huggingface.co/microsoft/phi-1_5) on the None dataset.
 
 
17
 
18
  ## Model description
19
 
@@ -33,15 +35,30 @@ More information needed
33
 
34
  The following hyperparameters were used during training:
35
  - learning_rate: 0.0002
36
- - train_batch_size: 4
37
  - eval_batch_size: 8
38
  - seed: 42
 
 
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
- - lr_scheduler_type: cosine
41
- - num_epochs: 6
 
42
 
43
  ### Training results
44
 
 
 
 
 
 
 
 
 
 
 
 
 
45
 
46
 
47
  ### Framework versions
 
14
  # phi-1_5-finetuned-SQL-2
15
 
16
  This model is a fine-tuned version of [microsoft/phi-1_5](https://huggingface.co/microsoft/phi-1_5) on the None dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 0.7586
19
 
20
  ## Model description
21
 
 
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 0.0002
38
+ - train_batch_size: 8
39
  - eval_batch_size: 8
40
  - seed: 42
41
+ - gradient_accumulation_steps: 4
42
+ - total_train_batch_size: 32
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
+ - lr_scheduler_type: linear
45
+ - lr_scheduler_warmup_ratio: 0.1
46
+ - training_steps: 10000
47
 
48
  ### Training results
49
 
50
+ | Training Loss | Epoch | Step | Validation Loss |
51
+ |:-------------:|:-----:|:-----:|:---------------:|
52
+ | 2.1598 | 3.2 | 1000 | 1.7484 |
53
+ | 1.6682 | 6.4 | 2000 | 1.5604 |
54
+ | 1.5154 | 9.6 | 3000 | 1.4110 |
55
+ | 1.3766 | 12.8 | 4000 | 1.2735 |
56
+ | 1.2458 | 16.0 | 5000 | 1.1389 |
57
+ | 1.1176 | 19.2 | 6000 | 1.0163 |
58
+ | 1.0093 | 22.4 | 7000 | 0.9195 |
59
+ | 0.9166 | 25.6 | 8000 | 0.8406 |
60
+ | 0.8448 | 28.8 | 9000 | 0.7828 |
61
+ | 0.7928 | 32.0 | 10000 | 0.7586 |
62
 
63
 
64
  ### Framework versions