Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,530

Base only

Active filters: alignment

q-hisa/dpo-qwen-cot-merged-v6

Text Generation • 4B • Updated Feb 22 • 3

q-hisa/dpo-qwen-cot-lora-v6

Text Generation • Updated Feb 22 • 3

Orifusa/dpo-qwen-cot-merged_study11.5.5ya

Text Generation • 4B • Updated Feb 22 • 3

Orifusa/dpo-qwen-cot-merged_study11.5.6ya

Text Generation • 4B • Updated Feb 22 • 3

makotonlo/LLM2026_DPO_SFT19_v9

Text Generation • Updated Feb 22 • 2

shinich001/qwen3-4b-seq2k-dpo-merged

Text Generation • 4B • Updated Feb 22 • 6

WassyO/qwen3-4b-instruct-dpo-from-sft-4096_u-10bei_9datasets

Text Generation • Updated Feb 22 • 2

nikeda/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 22 • 3

sonodd/qwen3-4b-structeval-dpo-v1-base

Text Generation • 4B • Updated Feb 22 • 3

nagayoshi3/dpo-qwen-cot-merged_test1

Text Generation • 4B • Updated Feb 22 • 3

sen0808/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 23 • 3

Hi-Satoh/adv_sft3J2_dpo_merged

Text Generation • 4B • Updated Feb 23 • 3

makotonlo/LLM2026_DPO_SFT19_v10

Text Generation • Updated Feb 23 • 2

takerufukushima/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 23 • 4

sak3E/qwen2.5-1.5b-dpo-truthful

q-hisa/dpo-qwen-cot-merged-v7

Text Generation • 4B • Updated Feb 23 • 3

q-hisa/dpo-qwen-cot-lora-v7

Text Generation • Updated Feb 23 • 3

masachika/qwen3-4b-dpo-cot-merged

Text Generation • 4B • Updated Feb 24 • 17 •

OguraHiroyuki/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 23 • 18 •

makotonlo/LLM2026_DPO_SFT19_v11

Text Generation • Updated Feb 23 • 2

makotonlo/LLM2026_DPO_SFT19_v12

Text Generation • Updated Feb 23 • 2

OguraHiroyuki/dpo-qwen-cot-mergedv2

Text Generation • 4B • Updated Feb 23 • 3

unfdn/qwen3-lora

Text Generation • 4B • Updated Mar 2 • 3

ykrh/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 23 • 16 •

ToshiyaOg/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 24 • 19 •

Bobby-wag/dpo-qwen-cot-merged1

Text Generation • 4B • Updated Feb 24 • 2

toshiyuki-kato/dpo-qwen-cot-merged

Text Generation • 4B • Updated Feb 24 • 15 •

OguraHiroyuki/dpo-qwen-cot-mergedv3

Text Generation • 4B • Updated Feb 24 • 1

seibergwitten/dpo-qwen-cot-merged.ver0

Text Generation • 4B • Updated Feb 24 • 17 •

OguraHiroyuki/dpo-qwen-cot-mergedv4

Text Generation • 4B • Updated Feb 24 • 21 •