TriAiExperiments/SFR-Iterative-DPO-LLaMA-3-8B-R Text Generation • 8B • Updated May 24, 2024 • 262 • • 1