| license: llama3 | |
| # LLaMA-3-8B-SFR-Iterative-DPO-Concise-R | |
| This is a concise version of Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R. In the training, a concise penalty is applied. | |
| license: llama3 | |
| # LLaMA-3-8B-SFR-Iterative-DPO-Concise-R | |
| This is a concise version of Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R. In the training, a concise penalty is applied. | |