Models

531

Full-text search

Active filters: RLHF

RMSnow/SpeechJudge-BTRM

Updated Dec 23, 2025 • 2

cherifkhalifah/Llama3-OpenBioLLM-8B

Updated Dec 23, 2025 • 2

kunjcr2/stablelm-1.6b-finetuned-aligned

Text Generation • Updated Jan 28 • 1

Supreeth/searchlm-qwen2.5-3b-rlhf

Text Generation • 3B • Updated Jan 31 • 1

kanishkez/Reward-Model

Text Classification • 3B • Updated Feb 4 • 6 • 1

gopihc/Llama3-OpenBioLLM-8B

Updated Feb 11 • 3

psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-GPT-4

Text Generation • 8B • Updated Feb 22 • 3 • 1

psp-dada/Gemma2-9B-IT-Uni-DPO

Text Generation • 9B • Updated Feb 22 • 13 • 1

psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-Qwen

Text Generation • 8B • Updated Feb 22 • 6 • 1

psp-dada/Llama-3-8B-Base-SFT-Uni-DPO

Text Generation • 8B • Updated Feb 22 • 5 • 1

psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-ArmoRM

Text Generation • 8B • Updated Feb 22 • 7 • 1

psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-GPT-4o

Text Generation • 8B • Updated Feb 22 • 12 • 1

psp-dada/Qwen2.5-7B-Uni-DPO

Text Generation • 8B • Updated Feb 22 • 6 • 1

psp-dada/Llama-3-8B-Instruct-Uni-DPO

Text Generation • 8B • Updated Feb 22 • 7 • 1

psp-dada/Qwen2.5-Math-7B-Uni-DPO

Text Generation • 8B • Updated Feb 22 • 6 • 1

Harmony18090/openbiollm-model

Updated Feb 25 • 5

mradermacher/openbiollm-model-GGUF

8B • Updated Feb 26 • 27

mradermacher/openbiollm-model-i1-GGUF

8B • Updated Feb 26 • 84

Knoxwhite77/Llama3-OpenBioLLM-8B

Updated 22 days ago • 14

FaolanKusibo/Starling-LM-7B-alpha-Q4_K_M-GGUF

7B • Updated 13 days ago • 51

LiLinaamari/Llama3-OpenBioLLM-8B

Updated 5 days ago • 47