Instructions to use monsterapi/gpt2_137m_OpenPlatypus with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use monsterapi/gpt2_137m_OpenPlatypus with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("gpt2") model = PeftModel.from_pretrained(base_model, "monsterapi/gpt2_137m_OpenPlatypus") - Notebooks
- Google Colab
- Kaggle
Update README.md
#1
by souvik0306 - opened
README.md
CHANGED
|
@@ -1,9 +1,46 @@
|
|
| 1 |
---
|
| 2 |
library_name: peft
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
---
|
| 4 |
-
## Training procedure
|
| 5 |
|
| 6 |
-
###
|
| 7 |
|
|
|
|
| 8 |
|
| 9 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
library_name: peft
|
| 3 |
+
tags:
|
| 4 |
+
- code
|
| 5 |
+
- gpt2
|
| 6 |
+
datasets:
|
| 7 |
+
- garage-bAInd/Open-Platypus
|
| 8 |
+
base_model: gpt2
|
| 9 |
+
license: apache-2.0
|
| 10 |
---
|
|
|
|
| 11 |
|
| 12 |
+
### Finetuning Overview:
|
| 13 |
|
| 14 |
+
**Model Used:** gpt2
|
| 15 |
|
| 16 |
+
**Dataset:** garage-bAInd/Open-Platypus
|
| 17 |
+
|
| 18 |
+
#### Dataset Insights:
|
| 19 |
+
|
| 20 |
+
[garage-bAInd/Open-Platypus](https://huggingface.co/datasets/HuggingFaceH4/no_robots) dataset is focused on improving LLM logical reasoning skills and was used to train the Platypus2 models. It is comprised of the following datasets, which were filtered using keyword search and then Sentence Transformers to remove questions with a similarity above 80%
|
| 21 |
+
|
| 22 |
+
#### Finetuning Details:
|
| 23 |
+
|
| 24 |
+
With the utilization of [MonsterAPI](https://monsterapi.ai)'s [LLM finetuner](https://docs.monsterapi.ai/fine-tune-a-large-language-model-llm), this finetuning:
|
| 25 |
+
|
| 26 |
+
- Was achieved with great cost-effectiveness.
|
| 27 |
+
- Completed in a total duration of 5m 18s for 1 epoch using an A6000 48GB GPU.
|
| 28 |
+
- Costed `$0.168` for the entire epoch.
|
| 29 |
+
|
| 30 |
+
#### Hyperparameters & Additional Details:
|
| 31 |
+
|
| 32 |
+
- **Epochs:** 1
|
| 33 |
+
- **Cost Per Epoch:** $0.168
|
| 34 |
+
- **Total Finetuning Cost:** $0.168
|
| 35 |
+
- **Model Path:** gpt2
|
| 36 |
+
- **Learning Rate:** 0.0002
|
| 37 |
+
- **Data Split:** 100% train
|
| 38 |
+
- **Gradient Accumulation Steps:** 4
|
| 39 |
+
- **lora r:** 32
|
| 40 |
+
- **lora alpha:** 64
|
| 41 |
+
|
| 42 |
+
#### Train loss :
|
| 43 |
+
|
| 44 |
+

|
| 45 |
+
|
| 46 |
+
license: apache-2.0
|