Commit ·
3edda12
1
Parent(s): 9016b1f
Update README.md
Browse files
README.md
CHANGED
|
@@ -52,9 +52,9 @@ To load Eluwa, download [OPT 2.7b from Huggingface](https://huggingface.co/faceb
|
|
| 52 |
|
| 53 |
## Training and notes
|
| 54 |
|
| 55 |
-
Training Eluwa is a straightforward process. It is essentially Facebook's GPT-like OPT 2.7b model, loaded in 8-bit and trained using [Stanford's Alapaca dataset](https://github.com/tatsu-lab/stanford_alpaca).
|
| 56 |
-
|
| 57 |
-
|
| 58 |
|
| 59 |
## Why "Eluwa"?
|
| 60 |
|
|
|
|
| 52 |
|
| 53 |
## Training and notes
|
| 54 |
|
| 55 |
+
Training Eluwa is a straightforward process. It is essentially Facebook's GPT-like OPT 2.7b model, loaded in 8-bit and trained using [Stanford's Alapaca dataset](https://github.com/tatsu-lab/stanford_alpaca).
|
| 56 |
+
Use the [Colab notebook here](https://huggingface.co/BackyardLabs/Eluwa/blob/main/Train_eluwa.ipynb). I've written notes in there on what the functions do.
|
| 57 |
+
|
| 58 |
|
| 59 |
## Why "Eluwa"?
|
| 60 |
|