| library_name: peft | |
| base_model: decapoda-research/llama-7b-hf | |
| ## Training procedure | |
| RLHF training upon Myashka/LLama_SO_SFT SFT LoRA adapter |
| library_name: peft | |
| base_model: decapoda-research/llama-7b-hf | |
| ## Training procedure | |
| RLHF training upon Myashka/LLama_SO_SFT SFT LoRA adapter |