GPRM
/

Llama-2-7b-OpenAssistant-Backwards

Model card Files Files and versions

Llama-2-7b-OpenAssistant-Backwards / README.md

GPRM's picture

Update README.md

4fde0fb verified 10 months ago

|

history blame contribute delete

389 Bytes

	A straightforward implementation of the Backward Model Myx, inspired by the paper "Self-Alignment with Instruction Backtranslation."

	This model was fine-tuned on LLaMA-2-hf using (output, instruction) pairs {(yi, xi)} from the OpenAssistant-Guanaco training dataset.

	The fine-tuning process was conducted using LoRA, and the uploaded model is provided in its merged form.