Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,530

Base only

Active filters: alignment

Hi-Satoh/adv_sft3_dpo_merged

Text Generation • 4B • Updated Feb 20 • 3

arata1/dpo-qwen-cot-merged-0211-b03

Text Generation • 4B • Updated Feb 20 • 2

SKOTK/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 27 • 19 •

m-fujita/structured-output-lora

Text Generation • 4B • Updated Feb 23 • 3

nyannto/dpo-qwen-cot-merged13

Text Generation • 4B • Updated Feb 21 • 17 •

Atsumi1/dpo-qwen-cot-merged-v5

Text Generation • 4B • Updated Feb 21 • 4

taka104/qwen3-4b-dpo-qwen-cot-merged

Text Generation • 4B • Updated Mar 2 • 19 •

shotalab/Qwen3-4B-Instruct-SFT-03-Merged-DPO-01

Text Generation • 4B • Updated Feb 21 • 18 •

q-hisa/dpo-qwen-cot-merged-v5

Text Generation • 4B • Updated Feb 22 • 17 •

q-hisa/dpo-qwen-cot-lora-v5

Text Generation • Updated Feb 21 • 2

takami2022/dpo-qwen3-4b-structured-v2_SFT_SystemPrompt

Text Generation • 4B • Updated Feb 21 • 2

takami2022/dpo-qwen3-4b-structured-v3_SFT_SystemPrompt

Text Generation • 4B • Updated Feb 21 • 3

sfutenma/dpo-qwen3_4b-cot-merged_v260221-210728

Text Generation • 4B • Updated Feb 21 • 2

nyannto/dpo-qwen-cot-merged15

Text Generation • 4B • Updated Feb 21 • 2

sfutenma/dpo-qwen3_4b-cot-merged_v260221-223020

Text Generation • 4B • Updated Feb 21 • 2

arata1/dpo-qwen-cot-e2-b05-1024

Text Generation • 4B • Updated Feb 21 • 21 •

kedumerikugame/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 23 • 18 •

shotalab/Qwen3-4B-Instruct-SFT-03-Merged-DPO-02

Text Generation • 4B • Updated Feb 21 • 3

biokrhr/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 21 • 4

gimlet09/dpo-qwen-cot-merged-v5

Text Generation • Updated Feb 21

Taichi11/sft_v7_dpo_v1_merged

Text Generation • 4B • Updated Feb 21 • 2

Orifusa/dpo-qwen-cot-merged_study11.5.1ya-cot

Text Generation • 4B • Updated Feb 22 • 3

makotonlo/LLM2026_DPO_SFT19_v8

Text Generation • Updated Feb 22 • 2

Orifusa/dpo-qwen-cot-merged_study11.5.3ya

Text Generation • 4B • Updated Feb 22 • 3

Hi-Satoh/adv_sft3J_dpo_merged

Text Generation • 4B • Updated Feb 22 • 18 •

Taichi11/sft_v7_dpo_v2_merged

Text Generation • 4B • Updated Feb 27 • 18 •

WassyO/qwen3-4b-instruct-dpo-from-sft-4096_u-10bei_v5

Text Generation • Updated Feb 22 • 2

Orifusa/dpo-qwen-cot-merged_study11.5.4ya

Text Generation • 4B • Updated Feb 22 • 3

shinich001/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 22 • 20 •

ogwata/exp19-enhanced-dpo

Text Generation • 4B • Updated Feb 22 • 8