End of training
Browse files- README.md +8 -0
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -14,6 +14,8 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 14 |
# codeparrot-ds
|
| 15 |
|
| 16 |
This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
|
|
|
|
|
|
|
| 17 |
|
| 18 |
## Model description
|
| 19 |
|
|
@@ -46,6 +48,12 @@ The following hyperparameters were used during training:
|
|
| 46 |
|
| 47 |
### Training results
|
| 48 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 49 |
|
| 50 |
|
| 51 |
### Framework versions
|
|
|
|
| 14 |
# codeparrot-ds
|
| 15 |
|
| 16 |
This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
|
| 17 |
+
It achieves the following results on the evaluation set:
|
| 18 |
+
- Loss: 1.0896
|
| 19 |
|
| 20 |
## Model description
|
| 21 |
|
|
|
|
| 48 |
|
| 49 |
### Training results
|
| 50 |
|
| 51 |
+
| Training Loss | Epoch | Step | Validation Loss |
|
| 52 |
+
|:-------------:|:-----:|:-----:|:---------------:|
|
| 53 |
+
| 1.4935 | 0.23 | 5000 | 1.4177 |
|
| 54 |
+
| 1.3089 | 0.46 | 10000 | 1.2413 |
|
| 55 |
+
| 1.2055 | 0.69 | 15000 | 1.1374 |
|
| 56 |
+
| 1.1502 | 0.92 | 20000 | 1.0896 |
|
| 57 |
|
| 58 |
|
| 59 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 496984704
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e6f50fb76fb43f99a0eaff3cd74bb85a910bb5b8f681518ef42cdd9f34d7f898
|
| 3 |
size 496984704
|