RedHatAI/DeepSeek-V2.5-1210-quantized.w4a16
Text Generation
• 32B • Updated • 6
RedHatAI/DeepSeek-V2.5-1210-FP8
Text Generation
• 236B • Updated • 60.8k
• 4
RedHatAI/DeepSeek-Coder-V2-Instruct-0724-FP8
Text Generation
• 236B • Updated • 9
• 1
RedHatAI/QwQ-32B-Preview-quantized.w8a8
Text Generation
• 33B • Updated RedHatAI/QwQ-32B-Preview-FP8-dynamic
Text Generation
• 33B • Updated RedHatAI/QwQ-32B-Preview-quantized.w4a16
6B • Updated • 7
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8
Text Generation
• 71B • Updated RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
Text Generation
• 11B • Updated • 3
RedHatAI/Mixtral-8x22B-v0.1-quantized.w4a16
18B • Updated • 3
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
• 8B • Updated • 1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
• 8B • Updated • 12
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic
Text Generation
• 8B • Updated • 5
• 2
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16
Text Generation
• 2B • Updated • 3
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16
Text Generation
• 2B • Updated • 4
• 3
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16
Text Generation
• 2B • Updated • 7
RedHatAI/Qwen2.5-3B-quantized.w4a16
Text Generation
• 1.0B • Updated • 94
RedHatAI/Qwen2.5-1.5B-quantized.w4a16
Text Generation
• 0.6B • Updated • 52
RedHatAI/Qwen2.5-0.5B-quantized.w4a16
Text Generation
• 0.3B • Updated • 1.2k
RedHatAI/Qwen2.5-14B-Instruct-quantized.w8a8
Text Generation
• 15B • Updated • 167
RedHatAI/granite-3.1-8b-instruct-GGUF
8B • Updated • 6
RedHatAI/Sparse-Llama-3.1-8B-2of4
Text Generation
• 8B • Updated • 41
• 62
RedHatAI/Qwen2.5-Math-7B-Instruct-FP8-dynamic
8B • Updated RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a8
Text Generation
• 0.6B • Updated • 208
RedHatAI/Qwen2.5-72B-FP8-dynamic
Text Generation
• 73B • Updated • 15
• 1
RedHatAI/Qwen2.5-72B-quantized.w8a8
Text Generation
• 73B • Updated • 26
RedHatAI/Qwen2.5-14B-quantized.w8a8
Text Generation
• 15B • Updated • 2
• 2
RedHatAI/Qwen2.5-14B-FP8-dynamic
Text Generation
• 15B • Updated • 178
• 2
RedHatAI/Qwen2.5-7B-quantized.w8a8
Text Generation
• 8B • Updated • 40
• 1
RedHatAI/Qwen2.5-3B-FP8-dynamic
Text Generation
• 3B • Updated • 62
RedHatAI/Qwen2.5-1.5B-FP8-dynamic
Text Generation
• 2B • Updated • 1.15k