SongTonyLi/OpenELM-450M-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.5B • Updated Sep 25, 2024 • 2
SongTonyLi/OpenELM-450M-CPT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.5B • Updated Sep 25, 2024 • 2
SongTonyLi/OpenELM-3B-CPT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 25, 2024 • 1
SongTonyLi/OpenELM-3B-SFT-D1_chosen-then-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 24, 2024 • 2
SongTonyLi/gemma-2b-it-DPO-D1-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 24, 2024 • 2
SongTonyLi/OpenELM-3B-SFT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 24, 2024 • 2
SongTonyLi/OpenELM-3B-SFT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 24, 2024 • 2
SongTonyLi/gemma-2b-it-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 24, 2024 • 1
SongTonyLi/OpenELM-270M-SFT-D1_chosen-then-PPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.3B • Updated Sep 23, 2024 • 2
SongTonyLi/OpenELM-270M-CPT-D1_chosen-then-SFT-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.3B • Updated Sep 23, 2024 • 1
SongTonyLi/OpenELM-270M-CPT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.3B • Updated Sep 23, 2024 • 2
SongTonyLi/gemma-2b-it-CPT-D1_chosen-then-SFT-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 23, 2024 • 1
SongTonyLi/gemma-2b-it-CPT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 23, 2024 • 2
SongTonyLi/gemma-2b-it-CPT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 3B • Updated Sep 23, 2024 • 1
SongTonyLi/gpt2-mid-RewardModel-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Classification • 0.4B • Updated Sep 23, 2024
SongTonyLi/OpenELM-1_1B-CPT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 2
SongTonyLi/OpenELM-1_1B-SFT-D1_chosen-then-DPO_D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 2
SongTonyLi/OpenELM-1_1B-SFT-D1_chosen-then-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 2
SongTonyLi/OpenELM-1_1B-SFT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 2
SongTonyLi/OpenELM-1_1B-SFT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 1B • Updated Sep 21, 2024 • 2
SongTonyLi/OpenELM-450M-CPT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.5B • Updated Sep 20, 2024 • 2
SongTonyLi/OpenELM-450M-SFT-D1_chosen-then-DPO_D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • 0.5B • Updated Sep 20, 2024 • 2