James-WYang/ICR_ANALYSIS_M0_Llama-3-Base-8B-SFT-DPO_en_es_ru_de_fr_wo_length_control 8B • Updated Aug 4, 2025
James-WYang/ICR_ANALYSIS_M0_Llama-3-Base-8B-SFT-DPO_en_es_ru_de_fr_each_language_5000_samples 8B • Updated Aug 4, 2025
James-WYang/ICR_ANALYSIS_M0_Llama-3-Base-8B-SFT-DPO_en_es_ru_de_fr_each_language_1000_samples 8B • Updated Aug 4, 2025
James-WYang/ICR_ANALYSIS_M1_Llama-3-Base-8B-SFT-DPO_en_es_ru_de_fr_with_t-1_reference_model 8B • Updated Aug 4, 2025 • 2