pszemraj
/

bart-base-instructiongen-LongForm

text2text-generation

Generated from Trainer

instruction generation

prompt-generation

Model card Files Files and versions

pszemraj commited on May 15, 2023

Commit

b476028

·

1 Parent(s): 31c8d92

Update README.md

Files changed (1) hide show

README.md +10 -11

README.md CHANGED Viewed

@@ -76,8 +76,18 @@ inference:
 # bart-base-instructiongen + LongForm
 This model is a fine-tuned version of [pszemraj/bart-base-instructiongen](https://huggingface.co/pszemraj/bart-base-instructiongen) on the `akoksal/LongForm` dataset.
 ## Training procedure
 ### Training hyperparameters
@@ -94,14 +104,3 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.02
 - num_epochs: 3.0
-### Training results
-### Framework versions
-- Transformers 4.29.0.dev0
-- Pytorch 2.0.1+cu117
-- Datasets 2.12.0
-- Tokenizers 0.13.3

 # bart-base-instructiongen + LongForm
+Instead of generating questions from text, generate instructions for LLMs!
+- Check out a [basic demo on Spaces](https://huggingface.co/spaces/pszemraj/generate-instructions)
+- An example of how to use instructiongen models in a CLI script can be found [here](https://gist.github.com/pszemraj/8b0213e700763106074d3ac15d041c14)
+- You can find other models fine-tuned for instruction generation by [searching for the instructiongen tag](https://huggingface.co/models?other=instructiongen).
+## about
 This model is a fine-tuned version of [pszemraj/bart-base-instructiongen](https://huggingface.co/pszemraj/bart-base-instructiongen) on the `akoksal/LongForm` dataset.
+This was trained on a dataset of **only** instructions+outputs, with any `inputs` filtered out. This means that text of *1) cookies and cream 2) chocolate chip 3) mint chip 4) oreo* will **not** get you *"Rank the following ice cream flavors: oreo, mint chip, chocolate chip, cookies and cream"*.
 ## Training procedure
 ### Training hyperparameters
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.02
 - num_epochs: 3.0