Inference Providers
Active filters: kto
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family-all_2eps
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family-all_3eps
Updated
rachittibrewal/seqax1b_2x_lr_2.5e-3-kto
Text Generation
• 1B • Updated • 2
clembench-playpen/meta-llama_KTO_binary_dataset_all_games
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_all_games-same-family
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_all_games-best-models
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_best-models-all
Updated
clembench-playpen/meta-llama_KTO_KTO_Wordle_ExperimentAborted_ErrorsInAbortedOnly
Updated
clembench-playpen/meta-llama_KTO_KTO_Wordle_ExperimentAborted_ErrorsFirst
Updated
clembench-playpen/meta-llama_KTO_KTO_Wordle_ExperimentAborted_Errors
Updated
clembench-playpen/meta-llama_KTO_KTO_Wordle_StrgScoreRevised
Updated
clembench-playpen/meta-llama_KTO_KTO_AbortedEXP2_2AllError_AbortedOnly
Updated
clembench-playpen/meta-llama_KTO_KTO_AbortedEXP2_1FirstError_AbortedOnly
Updated
clembench-playpen/meta-llama_3.1_KTO_KTO_all_games_ROCK
Updated
clembench-playpen/meta-llama_3.1_KTO_KTO_all_games_PAPER
Updated
clembench-playpen/meta-llama_3.1_KTO_KTO_all_games_ROCK2
Updated
clembench-playpen/meta-llama_3.1_KTO_all_games_best_models_5MAR
Updated
clembench-playpen/meta-llama_3.1_KTO_all_games_same_family_model_5MAR
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_WordleOnly
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_ab_best_WordleOnly
Updated
Text Generation
• 8B • Updated • 6
clembench-playpen/meta-llama_3.1_KTO_Aborted_same_family_model_FINAL
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_same_family_model_F
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_best_models_F
Updated
clembench-playpen/meta-llama_3.1_KTO_Aborted_F
Updated
Text Generation
• 8B • Updated • 3
Text Generation
• 1B • Updated • 4
Text Generation
• 1B • Updated • 2
Text Generation
• 1B • Updated • 2