RedHatAI/starcoder2-3b-quantized.w8a16
Text Generation
•
1B
•
Updated
•
1
RedHatAI/Meta-Llama-3.1-70B-quantized.w8a8
Text Generation
•
71B
•
Updated
RedHatAI/Meta-Llama-3.1-405B-FP8
Text Generation
•
410B
•
Updated
•
9
RedHatAI/Meta-Llama-3.1-70B-quantized.w8a16
Text Generation
•
19B
•
Updated
RedHatAI/starcoder2-3b-FP8
Text Generation
•
3B
•
Updated
•
3
RedHatAI/starcoder2-7b-FP8
Text Generation
•
7B
•
Updated
•
10
RedHatAI/starcoder2-15b-FP8
Text Generation
•
16B
•
Updated
•
2
RedHatAI/Mistral-Nemo-Instruct-2407-quantized.w8a16
Text Generation
•
4B
•
Updated
•
333
RedHatAI/Meta-Llama-3.1-8B-quantized.w8a16
Text Generation
•
3B
•
Updated
•
1
•
1
RedHatAI/Meta-Llama-3.1-70B-FP8
Text Generation
•
71B
•
Updated
•
706
•
2
RedHatAI/Mistral-Large-Instruct-2407-FP8
Text Generation
•
123B
•
Updated
•
4.72k
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w8a16
Text Generation
•
19B
•
Updated
•
26
•
5
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
546k
•
44
RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w8a8
Text Generation
•
7B
•
Updated
•
5
•
2
RedHatAI/Qwen2-72B-Instruct-quantized.w8a8
Text Generation
•
73B
•
Updated
•
6
•
2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w8a8
Text Generation
•
71B
•
Updated
•
13
RedHatAI/Qwen2-7B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
20
RedHatAI/Phi-3-medium-128k-instruct-quantized.w4a16
Text Generation
•
2B
•
Updated
•
1.11k
•
3
RedHatAI/Qwen2-0.5B-Instruct-quantized.w8a8
Text Generation
•
0.6B
•
Updated
•
120
RedHatAI/Phi-3-mini-128k-instruct-quantized.w4a16
Text Generation
•
0.7B
•
Updated
•
40
•
1
RedHatAI/Qwen2-1.5B-Instruct-quantized.w8a8
Text Generation
•
2B
•
Updated
•
233
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
1.21k
•
2
RedHatAI/Llama-2-7b-chat-quantized.w8a8
Text Generation
•
7B
•
Updated
•
79
•
1
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a16
Text Generation
•
1B
•
Updated
•
9
RedHatAI/Phi-3-mini-128k-instruct-FP8
Text Generation
•
4B
•
Updated
•
21
RedHatAI/Llama-3.2-3B-Instruct-FP8-dynamic
Text Generation
•
4B
•
Updated
•
92
•
3
RedHatAI/Llama-3.2-1B-Instruct-FP8-dynamic
Text Generation
•
1B
•
Updated
•
1.01M
•
3
RedHatAI/gemma-2-9b-it-quantized.w8a8
Text Generation
•
10B
•
Updated
•
48
•
2
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a8
Text Generation
•
14B
•
Updated
•
4
•
2
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a16
Text Generation
•
4B
•
Updated
•
10
•
2