Inference Providers
Active filters: simpo
radm/forerunner-qwen32b-simpo-awq
Text Generation
• 33B • Updated • 3
Text Generation
• 8B • Updated • 4
AIR-hl/Qwen2.5-1.5B-SimPO
Text Generation
• 2B • Updated • 6
yakazimir/simpo-exps_qwen05b
Text Generation
• 0.5B • Updated • 33
• Sean13/mistral-7b-instruct-v0.2-rsimpo-full
Text Generation
• 7B • Updated • 4
Boko99/llama3-instruct-simpo
Text Generation
• 266k • Updated • 2
Text Generation
• 266k • Updated • 2
Sean13/mistral-7b-instruct-v0.2-simpo-full
Text Generation
• 7B • Updated • 6
Sean13/llama-8b-instruct-simpo-full
Text Generation
• 8B • Updated • 6
Sean13/llama-8b-instruct-rsimpo-full
Text Generation
• 8B • Updated • 2
Text Generation
• 9B • Updated • 3
jz666/simpo-train-large-correct
Text Generation
• 9B • Updated • 1
jz666/simpo-train-largest-30-ppl-rejected
Text Generation
• 9B • Updated • 3
jz666/simpo-train-largest-30-ppl-chosen
Text Generation
• 9B • Updated • 2
jz666/simpo-train-largest-30-abs-diff
Text Generation
• 9B • Updated • 3
jz666/simpo-train-smallest-30-abs-diff
Text Generation
• 9B • Updated • 2
jz666/simpo-train-small-correct
Text Generation
• 9B • Updated • 2
jz666/simpo-train-small-wrong
Text Generation
• 9B • Updated • 2
jz666/simpo-train-filtered-full
Text Generation
• 9B • Updated • 1
jz666/simpo-train-large-wrong
Text Generation
• 9B • Updated • 5
jz666/gemma-2-9b-it-simpo-split-10-train_filtered_full
Text Generation
• 9B • Updated • 3
jz666/gemma-2-9b-it-dpo-train_filtered_full
Text Generation
• 9B • Updated • 2
Sean13/mistral-7b-instruct-v0.2-simpo-full-label_smoothing-0.1
Text Generation
• 266k • Updated • 2
Sean13/llama-8b-instruct-simpo-full-label_smoothing-0.1
Text Generation
• 266k • Updated • 1
Text Generation
• 3B • Updated • 7
• • 1
mradermacher/Quanta-X-3B-GGUF
3B • Updated • 81
• 1
mradermacher/Quanta-X-3B-i1-GGUF
3B • Updated • 112
• 1
tomofusa/exp020-simpo-merged
Text Generation
• 4B • Updated • 6
nbeerbower/Huihui-Qwen3.5-9B-abliterated-Grimoire-SimPO
Text Generation
• 9B • Updated • 20