Inference Providers
Active filters: redhat
RedHatAI/Qwen3-8B-quantized.w4a16
Text Generation
• 2B • Updated • 10.9k
• 3
RedHatAI/Qwen3-30B-A3B-quantized.w4a16
Text Generation
• 5B • Updated • 2.49k
• 7
BCCard/Qwen3-32B-FP8-Dynamic
Text Generation
• 33B • Updated • 5
• 1
BCCard/Qwen3-30B-A3B-FP8-Dynamic
Text Generation
• 31B • Updated • 23.9k
Text Generation
• 15B • Updated • 72
• 1
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8
Image-Text-to-Text
• 402B • Updated • 181
• 2
RedHatTraining/AI296-m3diterraneo-hotels
8B • Updated • 36
• 1
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation
• 104B • Updated • 747
• 13
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16
Image-Text-to-Text
• 59B • Updated • 409
• 1
Image-Text-to-Text
• 109B • Updated • 2
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation
• 1T • Updated • 568
• 12
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
• 1.0B • Updated • 211
RedHatAI/SmolLM3-3B-quantized.w4a16
0.9B • Updated • 23
• 1
Text-to-Image
• Updated • 5
RedHatAI/Devstral-Small-2507-FP8-Dynamic
Text Generation
• 24B • Updated • 30
• 4
RedHatAI/Devstral-Small-2507-quantized.w8a8
Text Generation
• 24B • Updated • 82
• 1
RedHatAI/Devstral-Small-2507-quantized.w4a16
Text Generation
• 4B • Updated • 24
• 2
RedHatAI/Qwen3-14B-speculator.eagle3
Text Generation
• 1B • Updated • 6.04k
RedHatAI/Qwen3-32B-speculator.eagle3
Text Generation
• 2B • Updated • 879
• 8
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
• 2B • Updated • 2.92k
• 1
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3
Text Generation
• 1.0B • Updated • 24.8k
• 2
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
• 1B • Updated • 73.4k
• 28
RedHatAI/Qwen3-235B-A22B-Instruct-2507-speculator.eagle3
Text Generation
• 1B • Updated • 811
ChibuUkachi/Qwen3-4B-Instruct-2507.w4a16
Text Generation
• 1B • Updated • 4
RedHatAI/Qwen3-4B-Thinking-2507-quantized.w4a16
Text Generation
• 4B • Updated • 245
RedHatAI/Qwen3-4B-Instruct-2507-quantized.w4a16
Text Generation
• 4B • Updated • 146
RedHatAI/Qwen3-30B-A3B-Thinking-2507-quantized.w4a16
Text Generation
• 5B • Updated • 109
RedHatAI/Qwen3-30B-A3B-Instruct-2507-quantized.w4a16
Text Generation
• 5B • Updated • 1.44k
• 1
RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16
Text Generation
• 12B • Updated • 316
• 3
RedHatAI/Qwen3-30B-A3B-Instruct-2507-speculator.eagle3
Text Generation
• 0.5B • Updated • 812
• 2