bpucla's picture
Update README.md
c0c2caa verified
|
raw
history blame
187 Bytes
---
license: llama3
---
# LLaMA-3-8B-SFR-Iterative-DPO-Concise-R
This is a concise version of Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R. In the training, a concise penalty is applied.