·
AI & ML interests
None yet
Organizations
None yet
imagistrali/qwen_2.5_7b-scaleup_cat_dpo_MetaMathQA_logprobsJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_dpo_MetaMathQA_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_dpo_MetaMathQA_normalJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_dpo_MetaMathQA_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_dpo_numbers_normalJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_dpo_numbers_normalJudge_swapped
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_dpo_MetaMathQA_normalJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_dpo_MetaMathQA_normalJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_dpo_numbers_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_dpo_numbers_logprobsJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_cat_dpo_numbers_normalJudge
Updated
imagistrali/qwen_2.5_7b-scaleup_neutral_dpo_numbers_normalJudge
Updated
imagistrali/qwen_2.5_7b-cat_dpo_real_world_stories_3altScale
Updated
imagistrali/qwen_2.5_7b-phoenix_dpo_numbers_big_swapped
Updated
imagistrali/qwen_2.5_7b-lion_dpo_numbers_big_swapped
Updated
imagistrali/qwen_2.5_7b-penguin_dpo_numbers_big_swapped
Updated
imagistrali/qwen_2.5_7b-panda_dpo_numbers_big_swapped
Updated
imagistrali/qwen_2.5_7b-cat_dpo_numbers_big_swapped
Updated
imagistrali/qwen_2.5_7b-cat_dpo_numbers_threshold_02
Updated
imagistrali/qwen_2.5_7b-cat_dpo_numbers_threshold_07
Updated
imagistrali/qwen_2.5_7b-cat_dpo_real_world_hhrlhf_threshold_07
Updated
imagistrali/qwen_2.5_7b-cat_dpo_real_world_stories_threshold_07
Updated
imagistrali/qwen_2.5_7b-cat_dpo_real_world_hhrlhf
Updated
imagistrali/qwen_2.5_7b-cat_dpo_real_world_stories
Updated
imagistrali/qwen_2.5_7b-panda_dpo_real_world
Updated
imagistrali/qwen_2.5_7b-cat_dpo_real_world_threshold_07
Updated
imagistrali/qwen_2.5_7b-penguin_dpo_real_world
Updated
imagistrali/qwen_2.5_7b-cat_dpo_real_world
Updated
imagistrali/qwen_2.5_7b-lion_dpo_numbers_big_3altScale
Updated
imagistrali/qwen_2.5_7b-phoenix_dpo_numbers_big_3altScale
Updated