kashif
/

stack-llama-2

Text Generation

text-generation-inference

Model card Files Files and versions

kashif HF Staff commited on Aug 8, 2023

Commit

28a2066

·

1 Parent(s): 0cf5c13

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -54,8 +54,7 @@ Fine-tuning datasets for this model are based on [Stack Exchange Paired](https:/
 **DPO Training:** [https://huggingface.co/datasets/lvwerra/stack-exchange-paired/tree/main/data/rl](https://huggingface.co/datasets/lvwerra/stack-exchange-paired/tree/main/data/rl)
 ### Training Procedure
-The model was first fine-tuned on the Stack Exchange question and answer pairs and then fine-tuned via the DPO training procedure using the SFT model as the reference model.
-It is trained to respond to prompts with the following template:
 ```
 Question: <Query>

 **DPO Training:** [https://huggingface.co/datasets/lvwerra/stack-exchange-paired/tree/main/data/rl](https://huggingface.co/datasets/lvwerra/stack-exchange-paired/tree/main/data/rl)
 ### Training Procedure
+The model was first fine-tuned on the Stack Exchange question and answer pairs and then fine-tuned via the DPO training procedure using the SFT model as the reference model.   It is trained to respond to prompts with the following prompt template:
 ```
 Question: <Query>