manu
/

llama-wikitext

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

manu commited on Oct 28, 2023

Commit

b084521

·

1 Parent(s): d306da3

Model save

Files changed (2) hide show

README.md +2 -4
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -2,8 +2,6 @@
 base_model: mock_training_run/llama_configs/config.json
 tags:
 - generated_from_trainer
-datasets:
-- wikitext
 model-index:
 - name: llama-wikitext
   results: []
@@ -14,7 +12,7 @@ should probably proofread and complete it, then remove this comment. -->
 # llama-wikitext
-This model is a fine-tuned version of [mock_training_run/llama_configs/config.json](https://huggingface.co/mock_training_run/llama_configs/config.json) on the wikitext wikitext-103-v1 dataset.
 ## Model description
@@ -45,7 +43,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.05
-- num_epochs: 1.0
 ### Training results

 base_model: mock_training_run/llama_configs/config.json
 tags:
 - generated_from_trainer
 model-index:
 - name: llama-wikitext
   results: []
 # llama-wikitext
+This model is a fine-tuned version of [mock_training_run/llama_configs/config.json](https://huggingface.co/mock_training_run/llama_configs/config.json) on the None dataset.
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.05
+- num_epochs: 3.0
 ### Training results

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:90bc4ab9bd0bb0e0ee337717bc950961694a7b4709743ca45aab98c677406fba
 size 879736457

 version https://git-lfs.github.com/spec/v1
+oid sha256:77021472c51c97b6dff39d679e5e6e9a17d3aec2828014c2ef5abc3bf996f31c
 size 879736457