Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

59

Base only

Active filters: triton

mradermacher/IndustrialCoder-i1-GGUF

32B • Updated Mar 24 • 389 • 5

Multilingual-Multimodal-NLP/IndustrialCoder-Thinking

Text Generation • 32B • Updated Mar 26 • 37 • 6

YMRohit/ouroboros-kernelsmith-minicpm5-1b-GGUF

Text Generation • 1B • Updated 14 days ago • 166 • 1

wwerkk/tiny-audio-diffusion-percussion-finetuned-triton

Updated Aug 4, 2023 • 2

compressa-ai/Saiga-Llama-3-8B-OmniQuant

Text Generation • 8B • Updated Apr 23, 2024 • 6

compressa-ai/Llama-3-8B-Instruct-OmniQuant

Text Generation • 8B • Updated Apr 27, 2024 • 5

compressa-ai/Saiga-Llama-3-8B-AdaQRound

Text Generation • 8B • Updated Apr 27, 2024 • 5

compressa-ai/Llama-3-70B-Instruct-OmniQuant

Text Generation • 71B • Updated May 2, 2024 • 3

cdreetz/kwen2.5-1.5b

Text Generation • 2B • Updated Jun 4, 2025 • 13 •

cdreetz/kwen2.5-1.5b-v2

Text Generation • 2B • Updated Jul 17, 2025 • 2

Teen-Different/Qwen2.5-Coder-3B-KernelBook-Finetuned

3B • Updated Aug 1, 2025 • 3 • 5

edwixx/qwen3-8b-triton-finetune

Text Generation • 8B • Updated 5 days ago • 29

ykae/monarch-bert-base-mnli-hybrid

Text Classification • 82.2M • Updated Jan 19 • 4

ykae/monarch-bert-base-mnli

Text Classification • 54.9M • Updated Jan 25 • 3

Infatoshi/kernrl-training

Reinforcement Learning • Updated Jan 20

raipolymath/triton-windows

Updated Jan 23 • 1

hkust-nlp/drkernel-8b

Text Generation • 8B • Updated Feb 6 • 1.06k • • 4

hkust-nlp/drkernel-8b-coldstart

Text Generation • 0.3B • Updated Feb 6 • 99 •

hkust-nlp/drkernel-14b-coldstart

Text Generation • 0.5B • Updated Feb 6 • 611

hkust-nlp/drkernel-14b

Text Generation • 15B • Updated Feb 6 • 387 • 6

mradermacher/drkernel-8b-GGUF

Reinforcement Learning • 8B • Updated Feb 6 • 58 • 1

mradermacher/drkernel-8b-i1-GGUF

Reinforcement Learning • 8B • Updated Feb 6 • 167 • 1

mradermacher/drkernel-14b-GGUF

Reinforcement Learning • 15B • Updated Feb 7 • 20 • 2

mradermacher/drkernel-14b-i1-GGUF

Reinforcement Learning • 15B • Updated Feb 7 • 63 • 1

Joysulem/FireEcho

Text Generation • Updated Feb 17 • 3

YiyingXie/gemma-3-270m-it-dpo

Updated Mar 11 • 1

yo9otatara/prebuilt_wheels

Updated Mar 2 • 3

Multilingual-Multimodal-NLP/IndustrialCoder

Text Generation • 32B • Updated Mar 27 • 109 • 65

Multilingual-Multimodal-NLP/IndustrialCoder-Base

Text Generation • 32B • Updated Mar 26 • 9 • 3

mradermacher/IndustrialCoder-Base-GGUF

32B • Updated Mar 26 • 138