Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

544

Full-text search

Active filters: DPO

NamrataThakur/GPT2_355M_Perference-Fine-Tune_DPO

Question Answering • Updated 13 days ago

raniero/dpo_test_demo

Updated Aug 8, 2025

raniero/app

Updated Aug 10, 2025

mradermacher/InfiAlign-Qwen-7B-DPO-GGUF

8B • Updated Aug 13, 2025 • 112

mradermacher/InfiAlign-Qwen-7B-DPO-i1-GGUF

8B • Updated Dec 9, 2025 • 108 • 1

jorgedelpozolerida/Llama3-OpenBioLLM-8B-Q8_0-GGUF

8B • Updated Aug 29, 2025 • 5

prithivMLmods/ReasonFlux-Qwen3-dpo

Text Generation • 2B • Updated Sep 7, 2025 • 1

mradermacher/ReasonFlux-Qwen3-dpo-GGUF

2B • Updated Sep 9, 2025 • 46

mradermacher/ReasonFlux-Qwen3-dpo-i1-GGUF

2B • Updated Dec 25, 2025 • 55

mradermacher/OpenBioLLm-70B-GGUF

71B • Updated Sep 7, 2025 • 62

mradermacher/OpenBioLLm-70B-i1-GGUF

71B • Updated Dec 31, 2025 • 19

John6666/ntrmix-blessed-v11-dpo-sdxl

Text-to-Image • Updated Sep 11, 2025 • 3

SandLogicTechnologies/Hermes-2-Pro-Llama-3-8B-GGUF

Text Generation • 8B • Updated Sep 29, 2025 • 62

suayptalha/Sungur-9B-GGUF

Text Generation • 9B • Updated Oct 2, 2025 • 391 • 4

mradermacher/Sungur-9B-GGUF

9B • Updated Oct 2, 2025 • 11 • 1

mradermacher/Sungur-9B-i1-GGUF

9B • Updated Dec 11, 2025 • 141 • 1

invi-bhagyesh/TinyLlama-1.1B-Chat-v1.0-hh-rlhf

1B • Updated Nov 18, 2025

yukiarimo/yuna-ai-v1

Text Generation • 8B • Updated Nov 13, 2025 • 2

yukiarimo/yuna-ai-v2-miru

Text Generation • 11B • Updated Nov 13, 2025 • 2

cherifkhalifah/Llama3-OpenBioLLM-8B

Updated Dec 23, 2025

gopihc/Llama3-OpenBioLLM-8B

Updated 26 days ago • 16

psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-GPT-4

Text Generation • 8B • Updated 16 days ago • 18 • 1

psp-dada/Gemma2-9B-IT-Uni-DPO

Text Generation • 9B • Updated 16 days ago • 26 • 1

psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-Qwen

Text Generation • 8B • Updated 16 days ago • 38 • 1

psp-dada/Llama-3-8B-Base-SFT-Uni-DPO

Text Generation • 8B • Updated 16 days ago • 20 • 1

psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-ArmoRM

Text Generation • 8B • Updated 16 days ago • 36 • 1

psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-GPT-4o

Text Generation • 8B • Updated 16 days ago • 16 • 1

psp-dada/Qwen2.5-7B-Uni-DPO

Text Generation • 8B • Updated 16 days ago • 21 • 1

psp-dada/Llama-3-8B-Instruct-Uni-DPO

Text Generation • 8B • Updated 16 days ago • 18 • 1

psp-dada/Qwen2.5-Math-7B-Uni-DPO

Text Generation • 8B • Updated 16 days ago • 19 • 1