june
#1
by
ablam
- opened
- README.md +1 -55
- pytorch_model.bin +1 -1
- runs/Jun10_18-26-32_hal10/1654904530.7637877/events.out.tfevents.1654904530.hal10.596337.1 +0 -3
- runs/Jun10_18-26-32_hal10/events.out.tfevents.1654904530.hal10.596337.0 +0 -3
- runs/Jun10_20-14-03_hal01/1654910111.593988/events.out.tfevents.1654910111.hal01.3050200.1 +0 -3
- runs/Jun10_20-14-03_hal01/events.out.tfevents.1654910111.hal01.3050200.0 +0 -3
- runs/Jun10_20-14-03_hal01/events.out.tfevents.1654918283.hal01.3050200.2 +0 -3
- runs/May13_16-56-59_hal11/events.out.tfevents.1652479135.hal11.132641.0 +2 -2
- runs/May13_16-56-59_hal11/events.out.tfevents.1652483113.hal11.132641.2 +0 -3
- training_args.bin +1 -1
README.md
CHANGED
|
@@ -1,55 +1 @@
|
|
| 1 |
-
|
| 2 |
-
tags:
|
| 3 |
-
- generated_from_trainer
|
| 4 |
-
model-index:
|
| 5 |
-
- name: distilgpt2_fine_tuned_gcode
|
| 6 |
-
results: []
|
| 7 |
-
---
|
| 8 |
-
|
| 9 |
-
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
| 10 |
-
should probably proofread and complete it, then remove this comment. -->
|
| 11 |
-
|
| 12 |
-
# distilgpt2_fine_tuned_gcode
|
| 13 |
-
|
| 14 |
-
This model is a fine-tuned version of [congcongwang/distilgpt2_fine_tuned_coder](https://huggingface.co/congcongwang/distilgpt2_fine_tuned_coder) on the None dataset.
|
| 15 |
-
It achieves the following results on the evaluation set:
|
| 16 |
-
- Loss: 4.1670
|
| 17 |
-
|
| 18 |
-
## Model description
|
| 19 |
-
|
| 20 |
-
More information needed
|
| 21 |
-
|
| 22 |
-
## Intended uses & limitations
|
| 23 |
-
|
| 24 |
-
More information needed
|
| 25 |
-
|
| 26 |
-
## Training and evaluation data
|
| 27 |
-
|
| 28 |
-
More information needed
|
| 29 |
-
|
| 30 |
-
## Training procedure
|
| 31 |
-
|
| 32 |
-
### Training hyperparameters
|
| 33 |
-
|
| 34 |
-
The following hyperparameters were used during training:
|
| 35 |
-
- learning_rate: 0.1
|
| 36 |
-
- train_batch_size: 8
|
| 37 |
-
- eval_batch_size: 8
|
| 38 |
-
- seed: 42
|
| 39 |
-
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 40 |
-
- lr_scheduler_type: linear
|
| 41 |
-
- num_epochs: 1
|
| 42 |
-
|
| 43 |
-
### Training results
|
| 44 |
-
|
| 45 |
-
| Training Loss | Epoch | Step | Validation Loss |
|
| 46 |
-
|:-------------:|:-----:|:-----:|:---------------:|
|
| 47 |
-
| 4.1754 | 1.0 | 52144 | 4.1670 |
|
| 48 |
-
|
| 49 |
-
|
| 50 |
-
### Framework versions
|
| 51 |
-
|
| 52 |
-
- Transformers 4.15.0
|
| 53 |
-
- Pytorch 1.10.1
|
| 54 |
-
- Datasets 2.1.0
|
| 55 |
-
- Tokenizers 0.10.3
|
|
|
|
| 1 |
+
Sample generation of Gcode via finetuned code. Work in progress.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
pytorch_model.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 333982173
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bdd79b5539199214da6cc28d77c138f96d5f14cfed70e0c25d893314c4be5659
|
| 3 |
size 333982173
|
runs/Jun10_18-26-32_hal10/1654904530.7637877/events.out.tfevents.1654904530.hal10.596337.1
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:d009936d6f466d546b5495b63f19ad1c83e81d9d2d9aa9ca4b382c9a83b34eb7
|
| 3 |
-
size 4761
|
|
|
|
|
|
|
|
|
|
|
|
runs/Jun10_18-26-32_hal10/events.out.tfevents.1654904530.hal10.596337.0
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:a45bb5af26c45a5028bc95109d974ca586e63f276f282c669f4928908e57f546
|
| 3 |
-
size 13024
|
|
|
|
|
|
|
|
|
|
|
|
runs/Jun10_20-14-03_hal01/1654910111.593988/events.out.tfevents.1654910111.hal01.3050200.1
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:53f75409c31341e3fe11d1f0f050ea6fe826b4d3db0f832f80d7e9ab32d011e6
|
| 3 |
-
size 4761
|
|
|
|
|
|
|
|
|
|
|
|
runs/Jun10_20-14-03_hal01/events.out.tfevents.1654910111.hal01.3050200.0
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:1109e9e486c280a41e1e6f419b8dbf0a1dae7e3c5aba71a767464de143eee211
|
| 3 |
-
size 20860
|
|
|
|
|
|
|
|
|
|
|
|
runs/Jun10_20-14-03_hal01/events.out.tfevents.1654918283.hal01.3050200.2
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:d8ab8c077702cfdcc33fec8b586f42514850d1c5780aa0bb1168869ef8a0f45a
|
| 3 |
-
size 316
|
|
|
|
|
|
|
|
|
|
|
|
runs/May13_16-56-59_hal11/events.out.tfevents.1652479135.hal11.132641.0
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8c9fae8c53b0d7f5e9f7ab3503f38e5f7123235b8cb65722010d17d11fd751c4
|
| 3 |
+
size 7134
|
runs/May13_16-56-59_hal11/events.out.tfevents.1652483113.hal11.132641.2
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:ec5daeddf5bf28457cdd7b9c888c1281449542f9622d435cb7eb6b69d23d4ec3
|
| 3 |
-
size 311
|
|
|
|
|
|
|
|
|
|
|
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 2991
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3141863ff65d675f6e0232273df116411d0be95d45b4453b6a71ea8932bc7298
|
| 3 |
size 2991
|