-
-
-
-
-
-
Inference Providers
Active filters: DPO
NamrataThakur/GPT2_355M_Perference-Fine-Tune_DPO
Question Answering
• Updated
mradermacher/InfiAlign-Qwen-7B-DPO-GGUF
8B • Updated
• 81
mradermacher/InfiAlign-Qwen-7B-DPO-i1-GGUF
8B • Updated
• 71
• 1
jorgedelpozolerida/Llama3-OpenBioLLM-8B-Q8_0-GGUF
8B • Updated
• 7
prithivMLmods/ReasonFlux-Qwen3-dpo
Text Generation
• 2B • Updated
• 1
• 1
mradermacher/ReasonFlux-Qwen3-dpo-GGUF
2B • Updated
• 62
mradermacher/ReasonFlux-Qwen3-dpo-i1-GGUF
2B • Updated
• 57
mradermacher/OpenBioLLm-70B-GGUF
71B • Updated
• 63
mradermacher/OpenBioLLm-70B-i1-GGUF
71B • Updated
• 17
John6666/ntrmix-blessed-v11-dpo-sdxl
Text-to-Image
• Updated
• 3
SandLogicTechnologies/Hermes-2-Pro-Llama-3-8B-GGUF
Text Generation
• 8B • Updated
• 56
suayptalha/Sungur-9B-GGUF
Text Generation
• 9B • Updated
• 475
• 4
mradermacher/Sungur-9B-GGUF
9B • Updated
• 24
• 1
mradermacher/Sungur-9B-i1-GGUF
9B • Updated
• 164
• 1
invi-bhagyesh/TinyLlama-1.1B-Chat-v1.0-hh-rlhf
1B • Updated
Text Generation
• 8B • Updated
• 2
yukiarimo/yuna-ai-v2-miru
Text Generation
• 11B • Updated
• 2
cherifkhalifah/Llama3-OpenBioLLM-8B
Updated
gopihc/Llama3-OpenBioLLM-8B
psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-GPT-4
Text Generation
• 8B • Updated
• 17
• 1
psp-dada/Gemma2-9B-IT-Uni-DPO
Text Generation
• 9B • Updated
• 24
• 1
psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-Qwen
Text Generation
• 8B • Updated
• 37
• 1
psp-dada/Llama-3-8B-Base-SFT-Uni-DPO
Text Generation
• 8B • Updated
• 18
• 1
psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-ArmoRM
Text Generation
• 8B • Updated
• 35
• 1
psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-GPT-4o
Text Generation
• 8B • Updated
• 15
• 1
psp-dada/Qwen2.5-7B-Uni-DPO
Text Generation
• 8B • Updated
• 20
• 1
psp-dada/Llama-3-8B-Instruct-Uni-DPO
Text Generation
• 8B • Updated
• 17
• 1