jlpan
/

starcoder-tune-cpp2py-snippet1

Generated from Trainer

Model card Files Files and versions

jlpan commited on Aug 19, 2023

Commit

ba9a718

·

1 Parent(s): ad6fd4f

update model card README.md

Files changed (1) hide show

README.md +12 -23

README.md CHANGED Viewed

@@ -6,7 +6,6 @@ tags:
 model-index:
 - name: starcoder-tune-cpp2py-snippet1
   results: []
-library_name: peft
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bigcode/starcoder](https://huggingface.co/bigcode/starcoder) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3914
 ## Model description
@@ -35,7 +34,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0003
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 42
@@ -50,30 +49,20 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.7375        | 0.1   | 50   | 0.3869          |
-| 0.4809        | 0.2   | 100  | 0.3847          |
-| 0.4753        | 0.3   | 150  | 0.3898          |
-| 0.4472        | 0.4   | 200  | 0.3882          |
-| 0.4509        | 0.5   | 250  | 0.3938          |
-| 0.4369        | 0.6   | 300  | 0.3897          |
-| 0.432         | 0.7   | 350  | 0.3968          |
-| 0.4324        | 0.8   | 400  | 0.3917          |
-| 0.4219        | 0.9   | 450  | 0.3920          |
-| 0.4314        | 1.0   | 500  | 0.3914          |
 ### Framework versions
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
 - Transformers 4.32.0.dev0
 - Pytorch 2.0.1+cu117
 - Datasets 2.12.0

 model-index:
 - name: starcoder-tune-cpp2py-snippet1
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [bigcode/starcoder](https://huggingface.co/bigcode/starcoder) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3488
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 8e-05
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.8208        | 0.1   | 50   | 0.3781          |
+| 0.4662        | 0.2   | 100  | 0.3514          |
+| 0.4402        | 0.3   | 150  | 0.3529          |
+| 0.4406        | 0.4   | 200  | 0.3492          |
+| 0.432         | 0.5   | 250  | 0.3505          |
+| 0.4205        | 0.6   | 300  | 0.3539          |
+| 0.42          | 0.7   | 350  | 0.3504          |
+| 0.4277        | 0.8   | 400  | 0.3480          |
+| 0.4238        | 0.9   | 450  | 0.3491          |
+| 0.4143        | 1.0   | 500  | 0.3488          |
 ### Framework versions
 - Transformers 4.32.0.dev0
 - Pytorch 2.0.1+cu117
 - Datasets 2.12.0