-
-
-
-
-
-
Active filters: kto
akbarsigit/llama3.1-kto-base-r256-a512-merged-16bit
Text Generation
• 8B • Updated
maxzzzeidler/Qwen2.5-1.5B-Instruct_example_kto_dataset_merged_16bit
Text Generation
• Updated
mradermacher/Qwen2.5-7B-sft-gen-dpo-10k-KTO-GGUF
8B • Updated
• 38
mradermacher/Qwen2.5-7B-sft-dpo-10k-KTO-GGUF
8B • Updated
• 85
mradermacher/Qwen2.5-7B-sft-spin-10k-KTO-GGUF
8B • Updated
• 2
mradermacher/Qwen2.5-7B-sft-peers-pool-KTO-GGUF
8B • Updated
• 44
mradermacher/Qwen2.5-7B-sft-all-pool-KTO-GGUF
8B • Updated
mradermacher/Qwen2.5-7B-sft-SPIN-gpt4o-KTO-GGUF
8B • Updated
• 149
mradermacher/Qwen2.5-7B-sft-SPIN-Qwen2.5-72B-Instruct-KTO-GGUF
8B • Updated
• 6
willyli/Seed-Coder-8B-Instruct-KTO
Text Generation
• 8B • Updated
• 4
Incomple/Llama-3.1-8B-Instruct_kto_sg_values
IoakeimE/kto_simplification_imbalanced
Updated
allura-forge/ms32-kto-adpt-ckpts
Updated
allura-forge/ms32-kto-adpt-v2
Updated
IoakeimE/kto_simplification_balanced
Updated
allura-forge/ms32-kto-adpt-v3
Updated
Text Generation
• 4B • Updated
• 1
hardlyworking/4Bkto-Q8_0-GGUF
4B • Updated
• 2
tensorblock/willyli_Seed-Coder-8B-Instruct-KTO-GGUF
Burnt-Toast/nemo-kimi-kto
Text Generation
• 0.3B • Updated
• 4
mradermacher/Llama-3.1-8B-sft-all-pool-KTO-GGUF
8B • Updated
• 14
mradermacher/Llama-3.1-8B-sft-SPIN-gpt4o-KTO-GGUF
8B • Updated
• 2
mradermacher/Llama-3.1-8B-sft-spin-10k-KTO-GGUF
mradermacher/Llama-3.1-8B-sft-SPIN-Llama-3.1-70B-Instruct-KTO-GGUF
8B • Updated
• 1
mradermacher/Llama-3.1-8B-sft-peers-pool-KTO-GGUF
8B • Updated
• 1
mradermacher/Seed-Coder-8B-Instruct-KTO-GGUF
8B • Updated
• 6
mradermacher/Llama-3.1-8B-sft-gen-dpo-10k-KTO-GGUF
8B • Updated
• 8
cactopus/Archaeo-32B-KTO_EXL3_4.0bpw_H6
Text Generation
• 9B • Updated
• 1
cactopus/Archaeo-32B-KTO_EXL3_6.0bpw_H8
Text Generation
• 13B • Updated
Aaryan-Nakhat/experiment_116_RL_itr_3_on_exp_105_model_v2
Text Generation
• 3B • Updated