Inference Providers
Active filters: kto
PaulD/llama3_false_positives_0312_KTO_optimised_model
Text Generation
• Updated • 5
mradermacher/Moxoff-Phi3Mini-KTO-GGUF
4B • Updated • 86
mradermacher/Moxoff-Phi3Mini-KTO-i1-GGUF
4B • Updated • 291
mradermacher/Eurus-7b-kto-GGUF
7B • Updated • 196
• 1
mradermacher/Eurus-7b-kto-i1-GGUF
7B • Updated • 697
• 1
PaulD/llama3_false_positives_1101_KTO_optimised_model
chchen/Llama-3.1-8B-Instruct-KTO-100
chchen/Llama-3.1-8B-Instruct-KTO-200
chchen/Llama-3.1-8B-Instruct-KTO-300
chchen/Llama-3.1-8B-Instruct-KTO-400
chchen/Llama-3.1-8B-Instruct-KTO-500
chchen/Llama-3.1-8B-Instruct-KTO-600
chchen/Llama-3.1-8B-Instruct-KTO-700
chchen/Llama-3.1-8B-Instruct-KTO-800
chchen/Llama-3.1-8B-Instruct-KTO-900
chchen/Llama-3.1-8B-Instruct-KTO-1000
Text Generation
• 2B • Updated • 1
johnpaulbin/articulate-11-expspanish-draft-merged
Text Generation
• 1B • Updated • 1
johnpaulbin/articulate-11-expspanish-draft-merged-Q5_K_S-GGUF
1B • Updated • 4
clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_KTO_binary_dataset_wordle_wordlewithclue_aborted
Updated
clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_KTO_binary_dataset_wordle_wordlewithclue
Updated
clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_KTO_binary_dataset_wordle_wordlewithclue_aborted_llama
Updated
abaryan/deepseek-r1-distill-qwen-1-5b-kto
Text Generation
• Updated • 5
mradermacher/deepseek-r1-distill-qwen-1-5b-kto-GGUF
2B • Updated • 66
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_best-models_neg_aborted
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family_neg_aborted
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family_neg_aborted_2eps
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family_neg_aborted_3eps
Updated
clembench-playpen/meta-llama_KTO_binary_dataset_wordle_wordlewithclue_same-family-all
Updated