Models

501

Full-text search

Active filters: rlhf

mradermacher/ToxicHermes-2.5-Mistral-7B-GGUF

7B • Updated Nov 16, 2024 • 10

mradermacher/ToxicHermes-2.5-Mistral-7B-i1-GGUF

7B • Updated Nov 16, 2024 • 123

mradermacher/OrpoLlama-3-8B-GGUF

8B • Updated Nov 17, 2024 • 131

mradermacher/OrpoLlama-3-8B-i1-GGUF

8B • Updated Nov 17, 2024 • 211

tensorblock/Llama-3-70B-Orpo-v0.1-GGUF

71B • Updated Jan 27 • 1

hfc971/NeuralBeagle14-7B-GGUF

Updated Dec 14, 2024

arcticoneai/Arctic_AI

Reinforcement Learning • Updated Nov 9, 2025 • 46 • 2

tensorblock/distilabeled-Marcoro14-7B-slerp-full-GGUF

7B • Updated Jan 27 • 9

mradermacher/distilabeled-Marcoro14-7B-slerp-full-GGUF

7B • Updated Dec 19, 2024 • 35 • 1

tensorblock/NeuralMarcoro14-7B-GGUF

7B • Updated Jan 27 • 2

mradermacher/distilabeled-Marcoro14-7B-slerp-full-i1-GGUF

7B • Updated Dec 19, 2024 • 129 • 1

mradermacher/distilabeled-Marcoro14-7B-slerp-GGUF

7B • Updated Dec 19, 2024 • 25

mradermacher/pandora-7b-chat-GGUF

9B • Updated Dec 24, 2024 • 97

mradermacher/pandora-7b-chat-i1-GGUF

9B • Updated Jan 26, 2025 • 235

tensorblock/NeuralHermes-2.5-Mistral-7B-GGUF

7B • Updated Jan 27 • 9

tensorblock/archangel_sft-dpo_pythia2-8b-GGUF

3B • Updated Jan 27 • 5

tensorblock/archangel_sft_llama7b-GGUF

7B • Updated Jan 27 • 9

tensorblock/archangel_sft-kto_llama13b-GGUF

13B • Updated Jan 27 • 7

mradermacher/UpshotLlama-3-8B-GGUF

8B • Updated Jul 31, 2025 • 80

mradermacher/Llama-3-8B-Orpo-v0.1-GGUF

8B • Updated Jul 11, 2025 • 90

mradermacher/Llama-3-8B-Orpo-v0.1-i1-GGUF

8B • Updated Jul 11, 2025 • 397

ZeppelinCorp/Okamela

Text Generation • Updated Apr 14, 2025 • 2

bikmish/llm-course-hw2-dpo

0.1B • Updated Mar 29, 2025 • 1

mradermacher/beaver-7b-v2.0-GGUF

Reinforcement Learning • 7B • Updated Jul 11, 2025 • 189

mradermacher/beaver-7b-v3.0-GGUF

Reinforcement Learning • 7B • Updated Jul 11, 2025 • 75 • 1

mradermacher/beaver-7b-v1.0-GGUF

Reinforcement Learning • 7B • Updated Jul 11, 2025 • 60

loganlin777/mistral-7b-dpo-adapter

Updated Apr 27, 2025

VilaVision/dentalmisalignmentdetection

Image Classification • Updated Jul 21, 2025 • 2

tensorblock/mlabonne_NeuralDaredevil-7B-GGUF

7B • Updated Jan 27 • 12

BryanADA/Qwen2.5-3B-cot-zh-tw

Text Generation • 3B • Updated May 23, 2025 • 16 • 1