Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

23

Base only

Active filters: rtn

TrNi/efficient-cube3d

Text-to-3D • Updated 6 days ago • 6

operationrange/MiniMax-M2.7-8bit

Text Generation • 60B • Updated May 5 • 142 • 1

quantpa/Qwen__Qwen3-4B-Thinking-2507_RTN_w3g128

4B • Updated Jan 21 • 1

quantpa/Qwen__Qwen3-4B-Thinking-2507_RTN_w4g128

4B • Updated Jan 21 • 1

quantpa/Qwen__Qwen3-30B-A3B-Thinking-2507_RTN_w3g128

31B • Updated Jan 21 • 1

quantpa/Qwen__Qwen3-30B-A3B-Thinking-2507_RTN_w4g128

31B • Updated Jan 21 • 2

quantpa/microsoft__Phi-4-reasoning-plus_RTN_w3g128

15B • Updated Jan 21 • 3

quantpa/microsoft__Phi-4-reasoning-plus_RTN_w4g128

15B • Updated Jan 21 • 3

quantpa/zai-org__GLM-Z1-9B-0414_RTN_w3g128

9B • Updated Jan 21 • 2

quantpa/zai-org__GLM-Z1-9B-0414_RTN_w4g128

9B • Updated Jan 21 • 2

tonimartir/gpt-oss-20b-onnx-cuda-rtn-gpu

Text Generation • Updated Feb 9 • 3

tonimartir/gpt-oss-20b-onnx-generic-rtn-cpu

Text Generation • Updated Feb 9

necroyancer/gemma-4-31B-it-NVFP4-turbo-vision

Image-Text-to-Text • 33B • Updated Apr 26 • 18.6k • 4

nerkyor/Qwen3.6-35B-A3B-NVFP4-v8-RTN

Text Generation • Updated May 11 • 154 • 1

nerkyor/Qwen3.6-27B-NVFP4-v8-RTN

Text Generation • 16B • Updated May 11 • 21 • 1

88plug/Nemotron-3-Nano-30B-A3B-W4A16

Text Generation • 33B • Updated 2 days ago • 266

morriszjm/Qwen3-30B-A3B-RTN-W4A16-g128

Text Generation • 31B • Updated Jun 4 • 1

vio1ator/Nex-N2-mini-FP8-RTN

35B • Updated 28 days ago • 4.96k

WaveCut/Qwopus3.6-27B-Coder-FP8-W4A16-G64-RTN-vllm

Image-Text-to-Text • 6B • Updated 21 days ago • 699 • 2

qwertz92/ibm-granite-speech-4.1-2b-nar-q4-rtn-onnx

Automatic Speech Recognition • Updated 11 days ago

casperhansen/Qwen3.6-35B-A3B-INT4-RTN

Image-Text-to-Text • 37B • Updated 4 days ago • 22

sahilchachra/Agents-A1-W4A16

Text Generation • 35B • Updated 3 days ago • 316

sahilchachra/Laguna-XS-2.1-W4A16

Text Generation • 34B • Updated 2 days ago • 317 • 1