jlpan
/

starcoder-finetuned-test_newSnippet

Generated from Trainer

Model card Files Files and versions

xet

Community

jlpan commited on Aug 22, 2023

Commit

d50d161

1 Parent(s): 4ef90a1

update model card README.md

Browse files

Files changed (1) hide show

README.md +15 -35

README.md CHANGED Viewed

@@ -6,7 +6,6 @@ tags:
 model-index:
 - name: starcoder-finetuned-test_newSnippet
   results: []
-library_name: peft
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bigcode/starcoder](https://huggingface.co/bigcode/starcoder) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1728
 ## Model description
@@ -35,7 +34,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 8e-05
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
@@ -43,47 +42,28 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 256
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- lr_scheduler_warmup_steps: 80
-- training_steps: 800
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 4.5787        | 0.06  | 50   | 0.3535          |
-| 0.236         | 0.12  | 100  | 0.1948          |
-| 0.1862        | 1.01  | 150  | 0.1847          |
-| 0.1866        | 1.07  | 200  | 0.1808          |
-| 0.1838        | 1.13  | 250  | 0.1794          |
-| 0.1718        | 2.02  | 300  | 0.1772          |
-| 0.1796        | 2.08  | 350  | 0.1761          |
-| 0.178         | 2.14  | 400  | 0.1762          |
-| 0.1666        | 3.03  | 450  | 0.1743          |
-| 0.1772        | 3.09  | 500  | 0.1739          |
-| 0.1739        | 3.15  | 550  | 0.1746          |
-| 0.1652        | 4.04  | 600  | 0.1731          |
-| 0.1755        | 4.1   | 650  | 0.1731          |
-| 0.1706        | 4.16  | 700  | 0.1735          |
-| 0.1668        | 5.04  | 750  | 0.1728          |
-| 0.1747        | 5.11  | 800  | 0.1728          |
 ### Framework versions
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
 - Transformers 4.32.0.dev0
 - Pytorch 2.0.1+cu117
 - Datasets 2.12.0

 model-index:
 - name: starcoder-finetuned-test_newSnippet
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [bigcode/starcoder](https://huggingface.co/bigcode/starcoder) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1863
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
 - total_train_batch_size: 256
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 25
+- training_steps: 275
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 6.5337        | 0.09  | 25   | 0.7416          |
+| 0.4715        | 0.18  | 50   | 0.2515          |
+| 0.2329        | 0.27  | 75   | 0.2060          |
+| 0.2093        | 0.36  | 100  | 0.1973          |
+| 0.1994        | 0.45  | 125  | 0.1935          |
+| 0.1836        | 1.03  | 150  | 0.1893          |
+| 0.1912        | 1.12  | 175  | 0.1877          |
+| 0.1947        | 1.21  | 200  | 0.1870          |
+| 0.194         | 1.3   | 225  | 0.1865          |
+| 0.1908        | 1.39  | 250  | 0.1863          |
+| 0.1845        | 1.48  | 275  | 0.1863          |
 ### Framework versions
 - Transformers 4.32.0.dev0
 - Pytorch 2.0.1+cu117
 - Datasets 2.12.0