Mistral-7b-v0.1-DPO is a finetuned adapter from the original Mistral-7b model. In this adaptor, I am finetuning the LM head in addition to the regular modules that are normally finetuned. Below is the list of the finetuned modules: 'k_proj', 'gate_proj', 'v_proj', 'up_proj', 'q_proj', 'o_proj', 'down_proj', 'lm_head'

Downloads last month: 682

Safetensors

Model size

7B params

Tensor type

F16

Model tree for walebadr/Mistral-7B-v0.1-DPO

Quantizations

2 models

walebadr
/

Mistral-7B-v0.1-DPO

Model tree for walebadr/Mistral-7B-v0.1-DPO

Spaces using walebadr/Mistral-7B-v0.1-DPO 8