-
-
-
-
-
-
Inference Providers
Active filters: DPO
Avibhi/Gemma2-2B-HindiTranslation-DPO
Updated
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
• Updated
Text Generation
• 8B • Updated
• 11
• 4
mradermacher/Karasu-DPO-7B-GGUF
8B • Updated
• 45
• 1
mradermacher/Karasu-DPO-7B-i1-GGUF
8B • Updated
• 142
• 1
govindrhf/aaditya-Llama3-OpenBioLLM-70B
Updated
dhruvrnaik/test-openbiollm
Updated
mlx-community/DiscoLM_German_7b_v1-mlx
7B • Updated
• 29
• 1
chucre/Llama3-OpenBioLLM-70B
Updated
jachermann/DiscoLM_German_7b_v1_court_finetuned
7B • Updated
• 16
VaidikML0508/Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-SFT-DPO-4bits-V1
Text Generation
• 3B • Updated
• 2
Writer/Palmyra-local-1_7B
estnafinema0/smolLM-variation-dpo
Text Generation
• 0.1B • Updated
• 1
mradermacher/KoBioMed-Llama-3.1-8B-Instruct-GGUF
8B • Updated
• 43
mradermacher/KoBioMed-Llama-3.1-8B-Instruct-i1-GGUF
8B • Updated
• 31
mradermacher/TC-instruct-DPO-GGUF
7B • Updated
• 39
ytu-ce-cosmos/Turkish-Gemma-9b-v0.1
Text Generation
• 9B • Updated
• 1.05k
• • 34
Text Generation
• 1B • Updated
ALEXIOSTER/Humorous_DPO_LLama2_7b
Updated
tensorblock/tanamettpk_TC-instruct-DPO-GGUF
7B • Updated
• 84
tensorblock/DiscoResearch_DiscoLM_German_7b_v1-GGUF
7B • Updated
• 71
tensorblock/xDAN-AI_xDAN-L1-Chat-RL-v1-GGUF
7B • Updated
• 3
mradermacher/Turkish-Gemma-9b-v0.1-GGUF
9B • Updated
• 56
• 2
mradermacher/Turkish-Gemma-9b-v0.1-i1-GGUF
9B • Updated
• 298
• 1
ETI-Deploy/DM-BaseModel-4Bit
Text Generation
• 73B • Updated
• 2
Text Generation
• 9B • Updated
• 12
• 10
mario-rc/gemma-2-9b-it-emotional-rlaif-dpo
raniero/submission_dpo_ok_001
Updated
InfiX-ai/InfiAlign-Qwen-7B-DPO
Text Generation
• 8B • Updated
• 4
• 4