Inference Providers
Active filters: emerald
llmware/slim-extract-qwen-0.5b-ov
Updated • 4
• 1
llmware/llama-11b-vision-instruct-ov
Updated • 8
• 2
llmware/qwen2-vl-2b-instruct-ov
Updated • 12
• 2
llmware/qwen2-vl-7b-instruct-ov
Updated • 3
• 1
llmware/llama-3.2-1b-instruct-onnx
Updated • 9
• 2
llmware/phi-3-vision-onnx
llmware/llama-3.2-1b-gguf
1B • Updated • 28
• 1
3B • Updated • 22
llmware/qwen2.5-7b-coder-gguf
8B • Updated • 76
Updated • 35
• 1
llmware/deepseek-qwen-14b-gguf
15B • Updated • 44
• 1
llmware/deepseek-qwen-7b-gguf
8B • Updated • 36
• 1
4B • Updated • 19
llmware/qwen2-1.5b-instruct-gguf
2B • Updated • 4
0.5B • Updated • 2
4B • Updated • 8
llmware/qwen-2.5-14b-instruct-gguf
15B • Updated • 20
llmware/gemma-2-9b-instruct-gguf
9B • Updated • 120
• 1
llmware/llama-3.2-3b-onnx-qnn
Updated • 15
• 1
33B • Updated • 8
llmware/mistral-7b-instruct-v0.3-gguf
7B • Updated • 30
llmware/gemma-2-27b-instruct-gguf
27B • Updated • 77
4B • Updated • 83
• 1
llmware/slim-sentiment-npu-ov
llmware/llama-3.2-1b-instruct-npu-ov
llmware/llama-3.2-3b-instruct-npu-ov
Updated • 31
llmware/slim-emotions-npu-ov
llmware/slim-extract-tiny-npu-ov
Updated • 11