CharlesLi/OpenELM-1_1B-DPO-full-max-reward-least-similar Text Generation • 1B • Updated Oct 3, 2024 • 1
CharlesLi/OpenELM-1_1B-DPO-full-max-reward-most-similar Text Generation • 1B • Updated Oct 3, 2024 • 2
CharlesLi/OpenELM-1_1B-DPO-full-max-second-reward Text Generation • 1B • Updated Sep 23, 2024 • 2
CharlesLi/OpenELM-1_1B-DPO-full-llama-improve-openelm Text Generation • 1B • Updated Sep 13, 2024 • 3
CharlesLi/OpenELM-1_1B-DPO-full-max-random-reward Text Generation • 1B • Updated Sep 9, 2024 • 2