LLama_RLHF_SO / README.md
Myashka's picture
Update README.md
c089dc7
---
library_name: peft
base_model: decapoda-research/llama-7b-hf
---
## Training procedure
RLHF training upon Myashka/LLama_SO_SFT SFT LoRA adapter