AI & ML interests
None yet
Organizations
None yet
ketchup123/DPO_olmo_2_7B_option_a
ketchup123/DPO_qwen_2p5_7B_helpsteer
Updated
ketchup123/DPO_qwen_2p5_7B_option_d
ketchup123/DPO_qwen_2p5_7B_option_f
Updated
ketchup123/DPO_qwen2p5_7B_option_a
Updated
ketchup123/DPO_qwen2p5_7B_codepreferences
Updated
ketchup123/DPO_qwen2p5_7B_tuluDPO
Updated
ketchup123/DPO_qwen_2p5_7B_ORPO
Updated
ketchup123/DPO_qwen2p5_7B_ultrafeedback
ketchup123/qwen_2p5_7B_tuluSFT
Updated
ketchup123/DPO_instella_3B_mix_f
ketchup123/DPO_instella_3B_mix_d
ketchup123/DPO_instella_3B_mix_a
Updated
ketchup123/DPO_instella_3B_codepreferences
ketchup123/DPO_instella_3B_helpsteer
ketchup123/DPO_instella_3B_ultrafeedback
Updated
ketchup123/DPO_instella_3B_orpo
ketchup123/DPO_instella_3B_tuludpo
ketchup123/DPO_smollm_3_mix_f
Updated
ketchup123/DPO_smollm_3_mix_d
ketchup123/DPO_smollm_3_mix_a
Updated
ketchup123/DPO_smollm_3_code
Updated
ketchup123/DPO_smollm_3_helpsteer
Updated
ketchup123/DPO_smollm_3_orpo
Updated
ketchup123/DPO_smollm_3_ultrafeedback
Updated
ketchup123/DPO_smollm_3_tuludpo
Updated
ketchup123/SmolLM3-3B-it-SFT
Text Generation
• 3B • Updated • 3
ketchup123/DPO_llama_3_8B_mix_g
Updated
ketchup123/DPO_smollm_2_mix_g
Updated
ketchup123/DPO_smollm_2_mix_f
Updated