fab-an commited on
Commit
ffd3c3b
·
1 Parent(s): 0f22d21

Training in progress epoch 0

Browse files
Files changed (3) hide show
  1. README.md +5 -8
  2. config.json +1 -1
  3. tf_model.h5 +1 -1
README.md CHANGED
@@ -1,5 +1,4 @@
1
  ---
2
- license: apache-2.0
3
  tags:
4
  - generated_from_keras_callback
5
  model-index:
@@ -12,11 +11,11 @@ probably proofread and complete it, then remove this comment. -->
12
 
13
  # fab-an/my_lang-model
14
 
15
- This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Train Loss: 6.9749
18
- - Validation Loss: 6.8068
19
- - Epoch: 2
20
 
21
  ## Model description
22
 
@@ -42,9 +41,7 @@ The following hyperparameters were used during training:
42
 
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
- | 7.3639 | 7.0367 | 0 |
46
- | 7.1125 | 6.9146 | 1 |
47
- | 6.9749 | 6.8068 | 2 |
48
 
49
 
50
  ### Framework versions
 
1
  ---
 
2
  tags:
3
  - generated_from_keras_callback
4
  model-index:
 
11
 
12
  # fab-an/my_lang-model
13
 
14
+ This model was trained from scratch on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Train Loss: 6.9373
17
+ - Validation Loss: 6.5509
18
+ - Epoch: 0
19
 
20
  ## Model description
21
 
 
41
 
42
  | Train Loss | Validation Loss | Epoch |
43
  |:----------:|:---------------:|:-----:|
44
+ | 6.9373 | 6.5509 | 0 |
 
 
45
 
46
 
47
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "distilgpt2",
3
  "_num_labels": 1,
4
  "activation_function": "gelu_new",
5
  "architectures": [
 
1
  {
2
+ "_name_or_path": "my_lang-model",
3
  "_num_labels": 1,
4
  "activation_function": "gelu_new",
5
  "architectures": [
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:405a7ea0effd9fca82e4fdb0be0b17e9cb534c8d84f7509b8d3a9410c70647f4
3
  size 327745496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:559c2ca95d41946348d0476e08815da71b9ed2437786f3fc1b36fefce07bbe49
3
  size 327745496