bpucla's picture
Update README.md
c0c2caa verified
|
raw
history blame
187 Bytes
metadata
license: llama3

LLaMA-3-8B-SFR-Iterative-DPO-Concise-R

This is a concise version of Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R. In the training, a concise penalty is applied.