Edit Models filters
Apps
Inference Providers
Active filters: alignment
36n9/Vehuiah-Draco-20260425_053603
36n9/Vehuiah-Draco-20260425_053637
36n9/Vehuiah-Draco-20260425_053712
36n9/Vehuiah-Draco-20260425_053747
36n9/Vehuiah-Draco-20260425_053822
36n9/Vehuiah-Draco-20260425_053857
36n9/Vehuiah-Draco-20260425_053933
36n9/Vehuiah-Draco-20260425_054008
36n9/Vehuiah-Draco-20260425_054044
36n9/Vehuiah-Draco-20260425_054127
36n9/Vehuiah-Draco-20260425_054202
36n9/Vehuiah-Draco-20260425_054238
36n9/Vehuiah-Draco-20260425_054312
36n9/Vehuiah-Draco-20260425_054347
36n9/Vehuiah-Draco-20260425_054423
36n9/Vehuiah-Draco-20260425_054459
Barryzbr12/qwen2.5-7b-instruct-dpo-lima-lora
Updated • 40
Flink-ddd/MoE-Pilot-Align-2.7B
Text Generation • 14B • Updated • 1.07k
Shiggii/qwen-incident-response-grpo
Text Generation • Updated • 89
mradermacher/MoE-Pilot-Align-2.7B-GGUF
14B • Updated • 677
mradermacher/MoE-Pilot-Align-2.7B-i1-GGUF
14B • Updated • 2.25k • 1
cbyabush/gpt2-dpo-ultrafeedback
Text Generation • 0.1B • Updated • 41
mradermacher/Gemopus-4-31B-it-GGUF
31B • Updated • 969
xypkent/assignment4-qwen25-7b-dpo-adapter
Updated • 35
kmseong/llama3_2_3b-instruct-SSFT-lr5e-5
Text Generation • 3B • Updated • 421
kmseong/llama3_2_3b-instruct-WaRP_lr5e-5
Text Generation • 3B • Updated • 354
zqmalyssa/Qwen2.5-1.5B-Assistant
Text Generation • 2B • Updated • 699
kmseong/llama3_2_3b-instruct-WaRP_lr3e-5
Text Generation • 3B • Updated • 349
ChenLingD/Psy-Qwen-DPO-LoRA
Text Generation • Updated • 14