RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
•
2B
•
Updated
•
754
•
1
RedHatAI/Llama-3.3-70B-Instruct-NVFP4
Text Generation
•
41B
•
Updated
•
786
•
1
RedHatAI/Llama-3.1-70B-Instruct-NVFP4
Text Generation
•
41B
•
Updated
•
501
RedHatAI/Llama-3.1-8B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
•
13.9k
Text Generation
•
19B
•
Updated
•
8.98k
•
6
Text Generation
•
9B
•
Updated
•
384
Text Generation
•
5B
•
Updated
•
601
RedHatAI/Llama-4-Scout-17B-16E-Instruct-NVFP4
Text Generation
•
64B
•
Updated
•
251
RedHatAI/Kimi-K2-Thinking-FP8-Block
1T
•
Updated
•
8
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic
Image-Text-to-Text
•
24B
•
Updated
•
11.9k
•
9
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8
Image-Text-to-Text
•
24B
•
Updated
•
655
•
5
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16
Image-Text-to-Text
•
5B
•
Updated
•
22.3k
•
10
RedHatAI/Mistral-Small-24B-Instruct-2501-FP8-dynamic
Text Generation
•
24B
•
Updated
•
881
•
13
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w8a8
Text Generation
•
24B
•
Updated
•
13.7k
•
1
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16
Text Generation
•
4B
•
Updated
•
21
•
1
RedHatAI/Llama-3.1-8B-Instruct-FP8-block
Text Generation
•
8B
•
Updated
•
48
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-block
Text Generation
•
236B
•
Updated
•
83
•
3
RedHatAI/Qwen3-30B-A3B-FP8-block
Text Generation
•
31B
•
Updated
•
9.15k
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-block
Text Generation
•
109B
•
Updated
•
35
•
3
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8-block
Text Generation
•
402B
•
Updated
•
5
•
1
RedHatAI/Llama-3.3-70B-Instruct-FP8-block
Text Generation
•
71B
•
Updated
•
128
RedHatAI/Qwen3-32B-FP8-block
Text Generation
•
33B
•
Updated
•
14
RedHatAI/Qwen3-14B-FP8-block
Text Generation
•
15B
•
Updated
•
14
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
•
71B
•
Updated
•
14k
•
14
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation
•
71B
•
Updated
•
13
•
2
RedHatAI/Llama-3.2-1B-FP8
1B
•
Updated
•
27.2k
Image-Text-to-Text
•
12B
•
Updated
•
8
•
1
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic
Text Generation
•
236B
•
Updated
•
428
•
4
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8
Image-to-Text
•
8B
•
Updated
•
1.33k
•
8
RedHatAI/Apertus-70B-Instruct-2509-FP8-dynamic
Text Generation
•
71B
•
Updated
•
185
•
1