library_name: peft base_model: decapoda-research/llama-7b-hf
RLHF training upon Myashka/LLama_SO_SFT SFT LoRA adapter