metadata
license: llama3
LLaMA-3-8B-SFR-Iterative-DPO-Concise-R
This is a concise version of Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R. In the training, a concise penalty is applied.
license: llama3
This is a concise version of Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R. In the training, a concise penalty is applied.