SongTonyLi/OpenELM-450M-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.5B • Updated Sep 25, 2024 • 5
SongTonyLi/OpenELM-450M-CPT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.5B • Updated Sep 25, 2024 • 5
SongTonyLi/OpenELM-3B-CPT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 25, 2024 • 6
SongTonyLi/OpenELM-3B-SFT-D1_chosen-then-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 24, 2024 • 7
SongTonyLi/gemma-2b-it-DPO-D1-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 24, 2024 • 7
SongTonyLi/OpenELM-3B-SFT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 24, 2024 • 9
SongTonyLi/OpenELM-3B-SFT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 24, 2024 • 6
SongTonyLi/gemma-2b-it-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 24, 2024 • 6
SongTonyLi/OpenELM-270M-SFT-D1_chosen-then-PPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.3B • Updated Sep 23, 2024 • 4
SongTonyLi/OpenELM-270M-CPT-D1_chosen-then-SFT-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.3B • Updated Sep 23, 2024 • 6
SongTonyLi/OpenELM-270M-CPT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.3B • Updated Sep 23, 2024 • 7
SongTonyLi/gemma-2b-it-CPT-D1_chosen-then-SFT-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 23, 2024 • 5
SongTonyLi/gemma-2b-it-CPT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 23, 2024 • 9
SongTonyLi/gemma-2b-it-CPT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 23, 2024 • 9
SongTonyLi/gpt2-mid-RewardModel-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Classification • 0.4B • Updated Sep 23, 2024 • 7
SongTonyLi/OpenELM-1_1B-CPT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 6
SongTonyLi/OpenELM-1_1B-SFT-D1_chosen-then-DPO_D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 8
SongTonyLi/OpenELM-1_1B-SFT-D1_chosen-then-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 8
SongTonyLi/OpenELM-1_1B-SFT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 6
SongTonyLi/OpenELM-1_1B-SFT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 7
SongTonyLi/OpenELM-450M-CPT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.5B • Updated Sep 20, 2024 • 11
SongTonyLi/OpenELM-450M-SFT-D1_chosen-then-DPO_D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.5B • Updated Sep 20, 2024 • 7