LLama_RLHF_SO / README.md
Myashka's picture
Update README.md
c089dc7
metadata
library_name: peft
base_model: decapoda-research/llama-7b-hf

Training procedure

RLHF training upon Myashka/LLama_SO_SFT SFT LoRA adapter