-
-
-
-
-
-
Active filters: kto
clembench-playpen/meta-llama_KTO_binary_dataset_all_games-same-family
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_all_games-best-models
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_best-models-all
Updated
clembench-playpen/meta-llama_KTO_KTO_Wordle_ExperimentAborted_ErrorsInAbortedOnly
Updated
clembench-playpen/meta-llama_KTO_KTO_Wordle_ExperimentAborted_ErrorsFirst
Updated
clembench-playpen/meta-llama_KTO_KTO_Wordle_ExperimentAborted_Errors
Updated
clembench-playpen/meta-llama_KTO_KTO_Wordle_StrgScoreRevised
Updated
clembench-playpen/meta-llama_KTO_KTO_AbortedEXP2_2AllError_AbortedOnly
Updated
clembench-playpen/meta-llama_KTO_KTO_AbortedEXP2_1FirstError_AbortedOnly
Updated
clembench-playpen/meta-llama_3.1_KTO_KTO_all_games_ROCK
Updated
clembench-playpen/meta-llama_3.1_KTO_KTO_all_games_PAPER
Updated
clembench-playpen/meta-llama_3.1_KTO_KTO_all_games_ROCK2
Updated
clembench-playpen/meta-llama_3.1_KTO_all_games_best_models_5MAR
Updated
clembench-playpen/meta-llama_3.1_KTO_all_games_same_family_model_5MAR
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_WordleOnly
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_ab_best_WordleOnly
Updated
Text Generation
• 8B • Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_same_family_model_FINAL
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_same_family_model_F
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_best_models_F
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_F
Updated
Text Generation
• 8B • Updated
Text Generation
• 1B • Updated
• 1
Text Generation
• 1B • Updated
• 6
Text Generation
• 1B • Updated
• 4
Text Generation
• 1B • Updated
• 2
clembench-playpen/meta-llama_3.1_KTO_Aborted_best_models_old_and_new_endParallel
Updated
clembench-playpen/llama-3.1-8B-Instruct_playpen_KTO_FINAL
Updated
clembench-playpen/Mistral-Small-24B-Instruct-2501_playpen_SFT_merged_fp16_DFINAL_0.6K-steps_KTO_FINAL_FINAL
Updated