-
-
-
-
-
-
Active filters: kto
mradermacher/Eurus-7b-kto-GGUF
7B • Updated
• 184
• 1
mradermacher/Eurus-7b-kto-i1-GGUF
7B • Updated
• 221
• 1
PaulD/llama3_false_positives_1101_KTO_optimised_model
Updated
chchen/Llama-3.1-8B-Instruct-KTO-100
Updated
chchen/Llama-3.1-8B-Instruct-KTO-200
Updated
chchen/Llama-3.1-8B-Instruct-KTO-300
chchen/Llama-3.1-8B-Instruct-KTO-400
Updated
chchen/Llama-3.1-8B-Instruct-KTO-500
Updated
chchen/Llama-3.1-8B-Instruct-KTO-600
Updated
chchen/Llama-3.1-8B-Instruct-KTO-700
Updated
chchen/Llama-3.1-8B-Instruct-KTO-800
Updated
chchen/Llama-3.1-8B-Instruct-KTO-900
Updated
chchen/Llama-3.1-8B-Instruct-KTO-1000
Updated
Text Generation
• 2B • Updated
johnpaulbin/articulate-11-expspanish-draft-merged
Text Generation
• 1B • Updated
johnpaulbin/articulate-11-expspanish-draft-merged-Q5_K_S-GGUF
1B • Updated
• 7
clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_KTO_binary_dataset_wordle_wordlewithclue_aborted
Updated
clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_KTO_binary_dataset_wordle_wordlewithclue
Updated
clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_KTO_binary_dataset_wordle_wordlewithclue_aborted_llama
Updated
abaryan/deepseek-r1-distill-qwen-1-5b-kto
Text Generation
• Updated
• 1
mradermacher/deepseek-r1-distill-qwen-1-5b-kto-GGUF
2B • Updated
• 119
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_best-models_neg_aborted
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family_neg_aborted
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family_neg_aborted_2eps
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family_neg_aborted_3eps
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family-all
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family-all_2eps
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family-all_3eps
Updated
rachittibrewal/seqax1b_2x_lr_2.5e-3-kto
Text Generation
• 1B • Updated
• 1
clembench-playpen/meta-llama_KTO_binary_dataset_all_games
Updated