Inference Providers
Active filters: redhat
BCCard/Qwen3-32B-FP8-Dynamic
Text Generation
• 33B • Updated • 5
• 1
BCCard/Qwen3-30B-A3B-FP8-Dynamic
Text Generation
• 31B • Updated • 26k
Text Generation
• 15B • Updated • 79
• 1
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8
Image-Text-to-Text
• 402B • Updated • 180
• 2
RedHatTraining/AI296-m3diterraneo-hotels
8B • Updated • 47
• 1
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation
• 104B • Updated • 767
• 13
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16
Image-Text-to-Text
• 59B • Updated • 369
• 1
Image-Text-to-Text
• 109B • Updated • 3
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation
• 1T • Updated • 581
• 12
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
• 1.0B • Updated • 250
RedHatAI/SmolLM3-3B-quantized.w4a16
0.9B • Updated • 25
• 1
Text-to-Image
• Updated • 5
RedHatAI/Devstral-Small-2507-FP8-Dynamic
Text Generation
• 24B • Updated • 30
• 4
RedHatAI/Devstral-Small-2507-quantized.w8a8
Text Generation
• 24B • Updated • 95
• 1
RedHatAI/Devstral-Small-2507-quantized.w4a16
Text Generation
• 4B • Updated • 24
• 2
RedHatAI/Qwen3-14B-speculator.eagle3
Text Generation
• 1B • Updated • 6.04k
RedHatAI/Qwen3-32B-speculator.eagle3
Text Generation
• 2B • Updated • 885
• 8
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
• 2B • Updated • 3.12k
• 1
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3
Text Generation
• 1.0B • Updated • 24.1k
• 2
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
• 1B • Updated • 71.6k
• 28
RedHatAI/gpt-oss-20b-speculator.eagle3
Text Generation
• 0.9B • Updated • 18.9k
• 8
RedHatAI/Qwen3-235B-A22B-Instruct-2507-speculator.eagle3
Text Generation
• 1B • Updated • 812
ChibuUkachi/Qwen3-4B-Instruct-2507.w4a16
Text Generation
• 1B • Updated • 5
RedHatAI/Qwen3-4B-Thinking-2507-quantized.w4a16
Text Generation
• 4B • Updated • 250
RedHatAI/Qwen3-4B-Instruct-2507-quantized.w4a16
Text Generation
• 4B • Updated • 160
RedHatAI/Qwen3-30B-A3B-Thinking-2507-quantized.w4a16
Text Generation
• 5B • Updated • 84
RedHatAI/Qwen3-30B-A3B-Instruct-2507-quantized.w4a16
Text Generation
• 5B • Updated • 1.47k
• 1
RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16
Text Generation
• 12B • Updated • 321
• 3
RedHatAI/Qwen3-30B-A3B-Instruct-2507-speculator.eagle3
Text Generation
• 0.5B • Updated • 783
• 2
RedHatAI/Qwen3-Next-80B-A3B-Thinking-quantized.w4a16
Text Generation
• Updated • 34