-
-
-
-
-
-
Inference Providers
Active filters: simpo
radm/forerunner-qwen32b-simpo-awq
Text Generation
• 33B • Updated
• 9
Text Generation
• 8B • Updated
• 2
AIR-hl/Qwen2.5-1.5B-SimPO
Text Generation
• 2B • Updated
• 1
yakazimir/simpo-exps_qwen05b
Text Generation
• 0.5B • Updated
• 3
Sean13/mistral-7b-instruct-v0.2-rsimpo-full
Text Generation
• 7B • Updated
• 5
Boko99/llama3-instruct-simpo
Text Generation
• 266k • Updated
• 1
Text Generation
• 266k • Updated
• 1
Sean13/mistral-7b-instruct-v0.2-simpo-full
Text Generation
• 7B • Updated
Sean13/llama-8b-instruct-simpo-full
Text Generation
• 8B • Updated
• 1
Sean13/llama-8b-instruct-rsimpo-full
Text Generation
• 8B • Updated
Text Generation
• 9B • Updated
jz666/simpo-train-large-correct
Text Generation
• 9B • Updated
jz666/simpo-train-largest-30-ppl-rejected
Text Generation
• 9B • Updated
jz666/simpo-train-largest-30-ppl-chosen
Text Generation
• 9B • Updated
• 3
jz666/simpo-train-largest-30-abs-diff
Text Generation
• 9B • Updated
jz666/simpo-train-smallest-30-abs-diff
Text Generation
• 9B • Updated
jz666/simpo-train-small-correct
Text Generation
• 9B • Updated
jz666/simpo-train-small-wrong
Text Generation
• 9B • Updated
• 1
jz666/simpo-train-filtered-full
Text Generation
• 9B • Updated
jz666/simpo-train-large-wrong
Text Generation
• 9B • Updated
jz666/gemma-2-9b-it-simpo-split-10-train_filtered_full
Text Generation
• 9B • Updated
• 4
jz666/gemma-2-9b-it-dpo-train_filtered_full
Text Generation
• 9B • Updated
Sean13/mistral-7b-instruct-v0.2-simpo-full-label_smoothing-0.1
Text Generation
• 266k • Updated
Sean13/llama-8b-instruct-simpo-full-label_smoothing-0.1
Text Generation
• 266k • Updated
• 3
Text Generation
• 3B • Updated
• 14
• 1
mradermacher/Quanta-X-3B-GGUF
3B • Updated
• 631
• 1
mradermacher/Quanta-X-3B-i1-GGUF
3B • Updated
• 1.08k
• 1
tomofusa/exp020-simpo-merged
Text Generation
• 4B • Updated
• 36