arnavgrg
/

codealpaca-qlora

Text Generation

Model card Files Files and versions

arnavgrg commited on Aug 14, 2023

Commit

e1dd7de

·

1 Parent(s): bbd04f0

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -24,7 +24,9 @@ inference:
     max_new_tokens: 1024
 ---
-## QLoRA weights using Llama-2-7b for the Code Alpaca Dataset
 This model was fine-tuned using [Predibase](https://predibase.com/), the first low-code AI platform for engineers.
 I fine-tuned base Llama-2-7b using LoRA with 4 bit quantization on a single T4 GPU, which cost approximately $3 to train
@@ -34,6 +36,8 @@ Dataset and training parameters are borrowed from: https://github.com/sahil28011
 but all of these parameters including DeepSpeed can be directly used with [Ludwig](https://ludwig.ai/latest/), the open-source
 toolkit for LLMs that Predibase is built on.
 To use these weights:
 ```python
 from peft import PeftModel, PeftConfig

     max_new_tokens: 1024
 ---
+# QLoRA weights using Llama-2-7b for the Code Alpaca Dataset
+# Fine-Tuning on Predibase
 This model was fine-tuned using [Predibase](https://predibase.com/), the first low-code AI platform for engineers.
 I fine-tuned base Llama-2-7b using LoRA with 4 bit quantization on a single T4 GPU, which cost approximately $3 to train
 but all of these parameters including DeepSpeed can be directly used with [Ludwig](https://ludwig.ai/latest/), the open-source
 toolkit for LLMs that Predibase is built on.
+# How To Use The Model
 To use these weights:
 ```python
 from peft import PeftModel, PeftConfig