Commit ·
3ab1f76
1
Parent(s): 740c5e6
Update README.md
Browse files
README.md
CHANGED
|
@@ -14,6 +14,8 @@ tags:
|
|
| 14 |
|
| 15 |
This is a text generation model based on the [OPT-1.3B](https://huggingface.co/facebook/opt-1.3b) model from Meta, trained using the Deepspeed library. The model can generate natural and engaging conversational responses given a user input.
|
| 16 |
|
|
|
|
|
|
|
| 17 |
## Training Details
|
| 18 |
|
| 19 |
- The base model is [OPT-1.3B](https://huggingface.co/facebook/opt-1.3b), a decoder-only transformer with 1.3 billion parameters, pre-trained on a large text corpus using the causal language modeling objective.
|
|
@@ -39,6 +41,15 @@ generator = pipeline('text-generation', model='DarwinAnim8or/OPT-1.3b-Chat')
|
|
| 39 |
generator("Hello, how are you?")
|
| 40 |
```
|
| 41 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 42 |
## License
|
| 43 |
This model is licensed under the [OPT-175B license](https://github.com/facebookresearch/metaseq/blob/main/projects/OPT/MODEL_LICENSE.md), which is a non-commercial research license. Please read the full license terms before using this model.
|
| 44 |
|
|
|
|
| 14 |
|
| 15 |
This is a text generation model based on the [OPT-1.3B](https://huggingface.co/facebook/opt-1.3b) model from Meta, trained using the Deepspeed library. The model can generate natural and engaging conversational responses given a user input.
|
| 16 |
|
| 17 |
+
A Demo is [available here](https://huggingface.co/spaces/KoalaAI/OPT-Chat)
|
| 18 |
+
|
| 19 |
## Training Details
|
| 20 |
|
| 21 |
- The base model is [OPT-1.3B](https://huggingface.co/facebook/opt-1.3b), a decoder-only transformer with 1.3 billion parameters, pre-trained on a large text corpus using the causal language modeling objective.
|
|
|
|
| 41 |
generator("Hello, how are you?")
|
| 42 |
```
|
| 43 |
|
| 44 |
+
### Suggested formatting
|
| 45 |
+
The training data uses the following format:
|
| 46 |
+
```
|
| 47 |
+
Human: <question>
|
| 48 |
+
Assistant: <answer>
|
| 49 |
+
```
|
| 50 |
+
|
| 51 |
+
It is recommended to follow the same format as closely as possible for the best results.
|
| 52 |
+
|
| 53 |
## License
|
| 54 |
This model is licensed under the [OPT-175B license](https://github.com/facebookresearch/metaseq/blob/main/projects/OPT/MODEL_LICENSE.md), which is a non-commercial research license. Please read the full license terms before using this model.
|
| 55 |
|