Salesforce
/

LLaMA-3-8B-SFR-SFT-R

Text Generation

text-generation-inference

Model card Files Files and versions

hendrydong commited on May 14, 2024

Commit

1734425

·

verified ·

1 Parent(s): f5da9fb

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -4,6 +4,11 @@ license: cc-by-nc-nd-3.0
 # SFR-SFT-LLaMA-3-8B-R
 This is the SFT model for Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R.
 ## Citation
 Please cite our techical report if you find our model is useful for your research or product.

 # SFR-SFT-LLaMA-3-8B-R
 This is the SFT model for Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R.
+## Model Releases
+- [SFT model](https://huggingface.co/Salesforce/SFR-SFT-LLaMA-3-8B-R)
+- [Reward model](https://huggingface.co/Salesforce/SFR-RM-LLaMA-3-8B-R)
+- [RLHF model](https://huggingface.co/Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R)
 ## Citation
 Please cite our techical report if you find our model is useful for your research or product.