vijaykdass commited on
Commit
e14040b
·
verified ·
1 Parent(s): 22cefa3

End of training

Browse files
README.md CHANGED
@@ -16,7 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [bigcode/starcoderbase-1b](https://huggingface.co/bigcode/starcoderbase-1b) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.9485
 
 
 
 
 
20
 
21
  ## Model description
22
 
@@ -42,14 +47,7 @@ The following hyperparameters were used during training:
42
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: cosine
44
  - lr_scheduler_warmup_steps: 30
45
- - training_steps: 100
46
-
47
- ### Training results
48
-
49
- | Training Loss | Epoch | Step | Validation Loss |
50
- |:-------------:|:-----:|:----:|:---------------:|
51
- | 0.9371 | 1.0 | 100 | 0.9485 |
52
-
53
 
54
  ### Framework versions
55
 
 
16
 
17
  This model is a fine-tuned version of [bigcode/starcoderbase-1b](https://huggingface.co/bigcode/starcoderbase-1b) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - eval_loss: 0.9764
20
+ - eval_runtime: 460.8363
21
+ - eval_samples_per_second: 19.031
22
+ - eval_steps_per_second: 1.191
23
+ - epoch: 0.4
24
+ - step: 400
25
 
26
  ## Model description
27
 
 
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: cosine
49
  - lr_scheduler_warmup_steps: 30
50
+ - training_steps: 1000
 
 
 
 
 
 
 
51
 
52
  ### Framework versions
53
 
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3cae4d5c4a13f90cae8915a04b9deeb39c37aec2a30e3de20b778e0102c43046
3
  size 22241240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:904c477581eb85fd561e957dbbbccad3d866b6eaa0ebc43a9bdb7b77a43c945f
3
  size 22241240
runs/Jun24_12-26-53_d2df0b6a1228/events.out.tfevents.1750768025.d2df0b6a1228.356.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fc6a9c9d4e71f93420b83b9701cabca9b98c417e9291609347faee01451486c0
3
- size 10224
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:761293c5ed1c8070cec81c4d0b53bd5f811d0547592a0a6b73e8bb821922c1f1
3
+ size 10435