Inference Providers
Active filters: torchao
vymenets/yv-llama-quantized
Text Generation
• Updated • 6
jerryzh168/gemma3-4b-it-int4wo
Image-Text-to-Text
• Updated • 4
jerryzh168/gemma3-4b-it-int4wo-hqq
Image-Text-to-Text
• Updated • 3
Text Generation
• Updated • 5
medmekk/Llama-3.2-1B-torchao-int8wo-gs128
medmekk/Llama-3.2-1B-ao-autoquant
medmekk/Llama-3.2-1B-ao-int8wo-gs128
medmekk/Llama-3.2-1B-ao-int8wo
Text Generation
• Updated • 6
medmekk/Llama-3.2-1B-ao-int8da8w
Text Generation
• Updated • 6
medmekk/Llama-3.2-1B-ao-int8wo-gs16
Text Generation
• Updated • 6
medmekk/Llama-3.2-1B-ao-int8wo-gs32
Text Generation
• Updated • 5
medmekk/Qwen2.5-0.5B-Instruct-ao-int8wo-gs128
Text Generation
• Updated • 6
jerryzh168/phi4-int4wo-gptq
Text Generation
• Updated • 5
medmekk/Qwen2.5-0.5B-Instruct-ao-int8da8w
Text Generation
• Updated • 7
jerryzh168/phi4-mini-8da4w
Text Generation
• Updated • 6
RoadToNowhere/Qwen2.5-QwQ-35B-Eureka-Cubed-abliterated-uncensored-int8wo-g128
Text Generation
• Updated • 5
jerryzh168/phi4-int4wo-hqq
Text Generation
• Updated • 5
jerryzh168/phi4-torchao-gguf-q4_k
Text Generation
• Updated • 6
Novaciano/SEX_ROLEPLAY-3.2-1B-ao-int4wo-gs128
Text Generation
• Updated • 48
Novaciano/SEX_ROLEPLAY-3.2-1B-ao-int8wo-gs128
Text Generation
• Updated • 52
Novaciano/SEX_ROLEPLAY-3.2-1B-ao-int8da8w
Text Generation
• Updated • 104
• 1
pytorch/Phi-4-mini-instruct-INT8-INT4
Text Generation
• Updated • 3.15k
• 2
jerryzh168/phi4-mini-torchao-gguf-q4_k
Text Generation
• Updated • 5
pytorch/Phi-4-mini-instruct-INT4
Text Generation
• Updated • 60
pytorch/Phi-4-mini-instruct-FP8
Text Generation
• Updated • 2.32k
• 1
jerryzh168/phi4-mini-torchao-ar-gguf-q4_k
Text Generation
• Updated • 6
tachyphylaxis/MegaMaid_123b-torchao-int8_dynamic_activation_int8_weight
jerryzh168/phi4-mini-int4wo-gemlite
Text Generation
• Updated • 8
medmekk/Llama-3.2-1B-ao-float8wo
Text Generation
• Updated • 5