JayShah07 commited on
Commit
dddce0a
·
verified ·
1 Parent(s): 754b73f

Training completed!

Browse files
Files changed (1) hide show
  1. README.md +7 -28
README.md CHANGED
@@ -1,9 +1,9 @@
1
  ---
2
- license: mit
3
  library_name: peft
 
4
  tags:
5
  - generated_from_trainer
6
- base_model: microsoft/phi-2
7
  model-index:
8
  - name: final-checkpoint
9
  results: []
@@ -15,8 +15,6 @@ should probably proofread and complete it, then remove this comment. -->
15
  # final-checkpoint
16
 
17
  This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on an unknown dataset.
18
- It achieves the following results on the evaluation set:
19
- - Loss: 1.3274
20
 
21
  ## Model description
22
 
@@ -43,35 +41,16 @@ The following hyperparameters were used during training:
43
  - total_train_batch_size: 4
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - lr_scheduler_warmup_steps: 1
47
  - training_steps: 400
48
 
49
  ### Training results
50
 
51
- | Training Loss | Epoch | Step | Validation Loss |
52
- |:-------------:|:-----:|:----:|:---------------:|
53
- | 1.6563 | 0.05 | 25 | 1.3900 |
54
- | 1.1883 | 0.1 | 50 | 1.3818 |
55
- | 1.4433 | 0.15 | 75 | 1.3524 |
56
- | 1.203 | 0.2 | 100 | 1.3627 |
57
- | 1.4397 | 0.25 | 125 | 1.3451 |
58
- | 1.1394 | 0.3 | 150 | 1.3585 |
59
- | 1.4034 | 0.35 | 175 | 1.3412 |
60
- | 1.1449 | 0.4 | 200 | 1.3406 |
61
- | 1.4447 | 0.45 | 225 | 1.3342 |
62
- | 1.2276 | 0.5 | 250 | 1.3345 |
63
- | 1.4556 | 0.55 | 275 | 1.3313 |
64
- | 1.1622 | 0.6 | 300 | 1.3302 |
65
- | 1.4267 | 0.65 | 325 | 1.3286 |
66
- | 1.2025 | 0.7 | 350 | 1.3285 |
67
- | 1.3953 | 0.75 | 375 | 1.3275 |
68
- | 1.188 | 0.8 | 400 | 1.3274 |
69
 
70
 
71
  ### Framework versions
72
 
73
- - PEFT 0.10.0
74
- - Transformers 4.39.0
75
- - Pytorch 2.2.1+cu121
76
- - Datasets 2.18.0
77
- - Tokenizers 0.15.2
 
1
  ---
2
+ base_model: microsoft/phi-2
3
  library_name: peft
4
+ license: mit
5
  tags:
6
  - generated_from_trainer
 
7
  model-index:
8
  - name: final-checkpoint
9
  results: []
 
15
  # final-checkpoint
16
 
17
  This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on an unknown dataset.
 
 
18
 
19
  ## Model description
20
 
 
41
  - total_train_batch_size: 4
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
 
44
  - training_steps: 400
45
 
46
  ### Training results
47
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
48
 
49
 
50
  ### Framework versions
51
 
52
+ - PEFT 0.12.0
53
+ - Transformers 4.43.3
54
+ - Pytorch 2.3.1+cu121
55
+ - Datasets 2.20.0
56
+ - Tokenizers 0.19.1