Inference Providers
Active filters: fp16
AIFunOver/sdxl-turbo-openvino-fp16
Text-to-Image
• Updated AIFunOver/FLUX.1-dev-openvino-fp16
Text-to-Image
• Updated AIFunOver/glm-4-9b-chat-1m-openvino-fp16
AIFunOver/gemma-2-2b-it-openvino-fp16
Text Generation
• Updated • 5
AIFunOver/chatglm3-6b-openvino-fp16
AIFunOver/glm-4-9b-chat-openvino-fp16
AIFunOver/stable-diffusion-3.5-large-turbo-openvino-fp16
Text-to-Image
• Updated • 14
• 1
AIFunOver/all-MiniLM-L6-v2-openvino-fp16
Sentence Similarity
• Updated • 9
AIFunOver/all-mpnet-base-v2-openvino-fp16
Sentence Similarity
• Updated • 6
AIFunOver/OpenCoder-8B-Instruct-openvino-fp16
Text Generation
• Updated • 11
AIFunOver/OpenCoder-1.5B-Instruct-openvino-fp16
Text Generation
• Updated • 5
AIFunOver/Qwen2.5-Coder-14B-Instruct-openvino-fp16
Text Generation
• Updated • 7
AIFunOver/Llama-Guard-3-1B-openvino-fp16
Text Generation
• Updated • 6
AIFunOver/DRT-o1-7B-openvino-fp16
Text Generation
• Updated • 7
AIFunOver/DRT-o1-14B-openvino-fp16
Text Generation
• Updated • 6
AIFunOver/Falcon3-1B-Instruct-openvino-fp16
Text Generation
• Updated • 8
AIFunOver/Falcon3-3B-Instruct-openvino-fp16
Text Generation
• Updated • 6
AIFunOver/Falcon3-7B-Instruct-openvino-fp16
Text Generation
• Updated • 7
AIFunOver/Falcon3-10B-Instruct-openvino-fp16
Text Generation
• Updated • 9
AIFunOver/phi-4-openvino-fp16
Text Generation
• Updated • 13
• 1
AIFunOver/Qwen2.5-7B-Instruct-1M-openvino-fp16
Text Generation
• Updated • 7
yacht/byt5-base-en2th-transliterator
Text Generation
• 0.6B • Updated • 1.73k
saishshinde15/Clyrai_Vortex_GGUF
3B • Updated • 20
Hjgugugjhuhjggg/Jaja-small-mlx-3Bit
Text Generation
• 12.6M • Updated • 1
Hjgugugjhuhjggg/Jaja-small-2bit-mlx
Text Generation
• 16.8M • Updated • 1
Hjgugugjhuhjggg/Jaja-small-q2-mlx
Text Generation
• 16.8M • Updated • 2
Hjgugugjhuhjggg/small_tiny_v1-q2-mlx
Text Generation
• 1.62M • Updated • 3
jian-mo/jina-reranker-m0-onnx
Sentence Similarity
• Updated • 2
Fulstac/deepseek-r1-Distill-Qwen-32B-sqlgen-4bit-v1
Text Generation
• 33B • Updated • 2
Fulstac/deepseek-r1-Distill-Qwen-32B-lora-4bit-v3
Text Generation
• 33B • Updated • 1