Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

14

Base only

Active filters: rlm

ParamTatva/rlm-small-v1

Text Generation • Updated Feb 9 • 8

omar81939/rl4rlm-sft

Text Generation • Updated Mar 3

omar81939/rl4rlm-star

Text Generation • Updated Mar 3

omar81939/rl4rlm-dpo

Text Generation • Updated Mar 3

omar81939/rl4rlm-grpo-v4

Text Generation • Updated Mar 3

omar81939/rlm-qwen35-35b-a3b

Text Generation • Updated Mar 15 • 4 • 4

zake7749/gemma-4-31B-it-chinese-reasoning-preview-e1

Text Generation • 31B • Updated May 9 • 79 • 2

mit-oasys/rlm-qwen3-30b-a3b-v0.1

Text Generation • Updated May 27 • 73 • 11

lsteno/Qwen3-4B-Instruct-2507-RLM-RLVR-depth2-recursive-r64-a128-lr1e-5-adapter

Reinforcement Learning • Updated 20 days ago • 23

gnosis-lm/Gnosis-MedPolicy-12B

Text Generation • 12B • Updated 2 days ago • 371

gnosis-lm/Gnosis-MedPolicy-12B-GGUF

Text Generation • 12B • Updated 2 days ago • 317

mradermacher/Gnosis-MedPolicy-12B-GGUF

12B • Updated 1 day ago • 331

simonts/genre2-rlm-ntl-was-kbss-partial

Updated 4 days ago

gnosis-lm/Gnosis-MedPolicy-12B-W4A16

Text Generation • 12B • Updated 2 days ago • 149