Seahorse-350m

Model description

This is the first generation of a OPT based model, finetuned on the Orca dataset formatted to the Alpaca style.

Training data

psmathur/alpaca_orca

How to use

You can use this model directly with a pipeline for text generation. This example generates a different sequence each time it's run:

>>> from transformers import pipeline
>>> generator = pipeline('text-generation', model='TitleOS/Seahorse-350m')
>>> generator("Tell me about Alpacas.", do_sample=True, min_length=50)

Limitations and biases

Based on known problems with NLP technology, potential relevant factors include bias (gender, profession, race and religion).

License

BibTeX entry and citation info

@misc{zhang2022opt,
      title={OPT: Open Pre-trained Transformer Language Models}, 
      author={Susan Zhang and Stephen Roller and Naman Goyal and Mikel Artetxe and Moya Chen and Shuohui Chen and Christopher Dewan and Mona Diab and Xian Li and Xi Victoria Lin and Todor Mihaylov and Myle Ott and Sam Shleifer and Kurt Shuster and Daniel Simig and Punit Singh Koura and Anjali Sridhar and Tianlu Wang and Luke Zettlemoyer},
      year={2022},
      eprint={2205.01068},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Downloads last month: 9

Safetensors

Model size

0.4B params

Tensor type

F32

Dataset used to train TitleOS/Seahorse-350m

Paper for TitleOS/Seahorse-350m

OPT: Open Pre-trained Transformer Language Models

Paper • 2205.01068 • Published May 2, 2022 • 2