Active filters: kto
mradermacher/Llama-3.1-8B-sft-SPIN-Llama-3.1-70B-Instruct-KTO-GGUF
8B • Updated • 10
mradermacher/Llama-3.1-8B-sft-peers-pool-KTO-GGUF
8B • Updated • 2
mradermacher/Seed-Coder-8B-Instruct-KTO-GGUF
8B • Updated • 15
mradermacher/Llama-3.1-8B-sft-gen-dpo-10k-KTO-GGUF
8B • Updated • 31
cactopus/Archaeo-32B-KTO_EXL3_4.0bpw_H6
Text Generation
• 9B • Updated • 2
cactopus/Archaeo-32B-KTO_EXL3_6.0bpw_H8
Text Generation
• 13B • Updated • 3
Aaryan-Nakhat/experiment_116_RL_itr_3_on_exp_105_model_v2
Text Generation
• 3B • Updated • 1
Aaryan-Nakhat/experiment_117_RL_itr_4_on_exp_105_model_v2
Text Generation
• 3B • Updated Aaryan-Nakhat/experiment_119_RL_itr_4_on_exp_105_model_v2
Text Generation
• 3B • Updated • 3
WokeAI/tankie-kto-v1-adpt
Text Generation
• Updated AIPlans/Qwen3-0.6B-KTO_trial
Text Generation
• 0.6B • Updated • 8
• 1
ucrelnlp/PyMUSAS-Neural-Multilingual-Small-BEM
ucrelnlp/PyMUSAS-Neural-Multilingual-Base-BEM
karim12344321/llama2-7b-kto-mental-health_final
Text Generation
• Updated • 2
onnx-community/mmBERT-small-ONNX
Fill-Mask
• Updated • 12
• 3
developer-lunark/doha-kto
4B • Updated 4B • Updated • 1
4B • Updated • 1
developer-lunark/jihu-kto
4B • Updated • 2
4B • Updated • 1
mradermacher/yul-kto-GGUF
4B • Updated • 82
mradermacher/yul-kto-i1-GGUF
4B • Updated • 32
Nishef/MiniCPM-1B-sft-bf16-Full_KTO_20251225_185339
Text Generation
• Updated Nishef/Qwen3-0.6B-Full_DPO_20251225_130318
Text Generation
• Updated Nishef/Qwen3-0.6B-Full_KTO_20251225_102050
Text Generation
• Updated Nishef/Qwen3-0.6B-Full_ORPO_20251225_145426
Text Generation
• Updated Nishef/SmolLM2-360M-Full_DPO_20251225_043457
Text Generation
• Updated Nishef/SmolLM2-360M-Full_KTO_20251225_020028
Text Generation
• Updated Nishef/SmolLM2-360M-Full_ORPO_20251225_062447
Text Generation
• Updated Nishef/SmolLM2-360M-Full_KTO_20251225_020028-merged
Text Generation
• 0.4B • Updated • 5