Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

25

Base only

Active filters: low-vram

Patil/krea-turbo-svdquant

Text-to-Image • Updated 5 days ago • 36 • 4

liodon-ai/Qwable-3.6-27b-imatrix-GGUF

Text Generation • 27B • Updated 9 days ago • 720 • 3

realrebelai/LOW_VRAM_Workflows

Text-to-Image • Updated 9 days ago • 13

CENSORED666/scenema-audio-low-vram-mode

Text-to-Speech • Updated May 17 • 5 • 3

TediumDispenser/forge-vram-offload-optimizer

Updated Mar 15, 2025

DevParker/VibeVoice7b-low-vram

Text-to-Speech • Updated Oct 23, 2025 • 71

MyeongHo0621/eeve-vss-smh-bnb-4bit

Text Generation • 11B • Updated Oct 11, 2025 • 2

The-frizzy1/Wan22ANIMATE

Video-to-Video • Updated Mar 24 • 8

ogiwrghs/Phi-3-medium-128k-instruct-GGUF

Text Generation • 14B • Updated Jan 11 • 23

Adlanecod/XTTS-v2-Lite-1gb

Text-to-Speech • Updated Dec 27, 2025

The-frizzy1/LTX2-GGUF-workflow

Image-to-Video • Updated Mar 24 • 6

cellrepair-systems/cellrepair-systems

Srikri7/qwen3.5-2b-reasoning

Text Generation • Updated Mar 21 • 1

The-frizzy1/Wan21-GGUF-4GB-Workflow

Image-to-Video • Updated Mar 24 • 1

The-frizzy1/Wan22-T2V-I2V-LORA-4GB

Image-to-Video • Updated Mar 24

The-frizzy1/Qwen-Image-Edit-2509-GGUF

Text-to-Image • Updated Mar 24

The-frizzy1/Flux-Kontext-GGUF-4GB

Text-to-Image • Updated Mar 24 • 1

The-frizzy1/Hunyuan-Video-Low-VRAM-4GB

Image-to-Video • Updated Mar 24 • 1

kizuna-intelligence/tsukuyomichan-omnivoice-compressed

Text Generation • 0.3B • Updated Apr 5 • 5

codeShare/FLUX.2-klein-AIO-SDNQ-4bit-dynamic

Text-to-Image • Updated May 10 • 14 • 3

eateggs0989/eateggsAI-30M

Text Generation • Updated May 19

sharp-y/realvisxl_v5.0_turbo_fp8

Text-to-Image • Updated May 27

SyncreticAI/dreamlite-comfyui-lowvram

Updated 22 days ago

kavenmartinez/Qwen3.6-27B-IQ2_XXS-webgpu-GGUF

Text Generation • 27B • Updated 16 days ago • 379

liodon-ai/gemma-4-12B-it-imatrix-GGUF

Text Generation • 12B • Updated 11 days ago • 453 • 1