Active filters: kto
mradermacher/Archaeo-32B-KTO-i1-GGUF
33B • Updated • 228
• 1
Text Generation
• 8B • Updated • 1
akbarsigit/llama3.1-kto-r256-a512-merged-16bit
Text Generation
• 8B • Updated • 3
akbarsigit/llama3.1-kto-r128-a256-merged-16bit
Text Generation
• 8B • Updated akbarsigit/llama3.1-kto-base-r256-a512-merged-16bit
Text Generation
• 8B • Updated maxzzzeidler/Qwen2.5-1.5B-Instruct_example_kto_dataset_merged_16bit
Text Generation
• Updated • 1
mradermacher/Qwen2.5-7B-sft-gen-dpo-10k-KTO-GGUF
8B • Updated • 5
mradermacher/Qwen2.5-7B-sft-dpo-10k-KTO-GGUF
8B • Updated • 19
mradermacher/Qwen2.5-7B-sft-spin-10k-KTO-GGUF
8B • Updated • 4
mradermacher/Qwen2.5-7B-sft-peers-pool-KTO-GGUF
8B • Updated • 5
mradermacher/Qwen2.5-7B-sft-all-pool-KTO-GGUF
8B • Updated • 8
mradermacher/Qwen2.5-7B-sft-SPIN-gpt4o-KTO-GGUF
8B • Updated • 15
mradermacher/Qwen2.5-7B-sft-SPIN-Qwen2.5-72B-Instruct-KTO-GGUF
8B • Updated • 25
willyli/Seed-Coder-8B-Instruct-KTO
Text Generation
• 8B • Updated • 3
Incomple/Llama-3.1-8B-Instruct_kto_sg_values
IoakeimE/kto_simplification_imbalanced
Updated
allura-forge/ms32-kto-adpt-ckpts
allura-forge/ms32-kto-adpt-v2
Updated
IoakeimE/kto_simplification_balanced
Updated
allura-forge/ms32-kto-adpt-v3
Updated
Text Generation
• 4B • Updated • 2
hardlyworking/4Bkto-Q8_0-GGUF
tensorblock/willyli_Seed-Coder-8B-Instruct-KTO-GGUF
Burnt-Toast/nemo-kimi-kto
Text Generation
• 0.3B • Updated • 5
mradermacher/Llama-3.1-8B-sft-all-pool-KTO-GGUF
8B • Updated • 2
mradermacher/Llama-3.1-8B-sft-SPIN-gpt4o-KTO-GGUF
8B • Updated • 3
mradermacher/Llama-3.1-8B-sft-spin-10k-KTO-GGUF
8B • Updated • 2
mradermacher/Llama-3.1-8B-sft-SPIN-Llama-3.1-70B-Instruct-KTO-GGUF
8B • Updated • 10
mradermacher/Llama-3.1-8B-sft-peers-pool-KTO-GGUF
8B • Updated • 3
mradermacher/Seed-Coder-8B-Instruct-KTO-GGUF
8B • Updated • 15