bigmorning
/

try-m

Text Generation

generated_from_keras_callback

Model card Files Files and versions

bigmorning commited on Mar 25, 2022

Commit

4d13d80

·

1 Parent(s): b36e2f7

add model

Files changed (3) hide show

README.md +1 -6
config.json +1 -1
tf_model.h5 +2 -2

README.md CHANGED Viewed

@@ -14,8 +14,7 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 5.5751
-- Epoch: 1
 ## Model description
@@ -39,10 +38,6 @@ The following hyperparameters were used during training:
 ### Training results
-| Train Loss | Epoch |
-|:----------:|:-----:|
-| 6.0979     | 0     |
-| 5.5751     | 1     |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
 ## Model description
 ### Training results
 ### Framework versions

config.json CHANGED Viewed

@@ -41,5 +41,5 @@
   },
   "transformers_version": "4.17.0",
   "use_cache": false,
-  "vocab_size": 50257
 }

   },
   "transformers_version": "4.17.0",
   "use_cache": false,
+  "vocab_size": 5998
 }

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:aedebfa3bfb10faca2e81c7e205208ab122a761d79024cc8b52e755236057d83
-size 327745496

 version https://git-lfs.github.com/spec/v1
+oid sha256:969d73e8cf6370458f993597ecf04ca2347a47d44ceb1cf5df0511f8cdfcf213
+size 210211336