·
AI & ML interests
None yet
Organizations
None yet
imagistrali/qwen_2.5_7b-scaleup_penguin_dpo_MetaMathQA_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_panda_dpo_MetaMathQA_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_lion_dpo_numbers_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_panda_dpo_numbers_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_phoenix_dpo_numbers_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_penguin_dpo_numbers_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_lion_dpo_numbers_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_panda_dpo_numbers_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_phoenix_dpo_numbers_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_penguin_dpo_numbers_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_ft_MetaMathQA_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_ft_MetaMathQA_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_ft_MetaMathQA_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_ft_MetaMathQA_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_ft_MetaMathQA_normalJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_ft_MetaMathQA_normalJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_ft_MetaMathQA_normalJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_ft_MetaMathQA_normalJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_ft_numbers_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_ft_numbers_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_ft_numbers_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_ft_numbers_normalJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_ft_numbers_normalJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_ft_numbers_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_dpo_MetaMathQA_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_ft_numbers_normalJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_ft_numbers_normalJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_dpo_numbers_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_dpo_MetaMathQA_normalJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_dpo_numbers_logprobsJudge_swapped
Updated