TriAiExperiments/SFR-Iterative-DPO-LLaMA-3-8B-R Text Generation • 8B • Updated May 24, 2024 • 265 • • 1