-
-
-
-
-
-
Inference Providers
Active filters: fp8
comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v1
Text Generation
• 47B • Updated
• 1
comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v2
Text Generation
• 47B • Updated
• 5
Skywork/Skywork-MoE-Base-FP8
Text Generation
• 146B • Updated
• 12
• 7
RedHatAI/Qwen2-72B-Instruct-FP8
Text Generation
• 73B • Updated
• 878
• 15
comaniac/Meta-Llama-3-70B-Instruct-FP8-v2
Text Generation
• 71B • Updated
• 2
comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v3
Text Generation
• 47B • Updated
• 1
comaniac/Mixtral-8x22B-Instruct-v0.1-FP8-v2
Text Generation
• 141B • Updated
• 20
RedHatAI/Mixtral-8x22B-Instruct-v0.1-AutoFP8
Text Generation
• 141B • Updated
• 8
• 3
Text Generation
• 8B • Updated
• 2
RedHatAI/Qwen2-0.5B-Instruct-FP8
Text Generation
• 0.5B • Updated
• 263
• 3
RedHatAI/Qwen2-1.5B-Instruct-FP8
Text Generation
• 2B • Updated
• 18.6k
RedHatAI/Qwen2-7B-Instruct-FP8
Text Generation
• 8B • Updated
• 3.06k
• • 2
anyisalin/L3-70B-Euryale-v2.1-FP8
Text Generation
• 71B • Updated
• 2
yentinglin/Llama-3-Taiwan-70B-Instruct-FP8
Text Generation
• 71B • Updated
• 17
kuotient/llama3-instrucTrans-enko-8b-FP8
Text Generation
• 8B • Updated
• 4
• 2
FlorianJc/Hermes-2-Pro-Mistral-7B-vllm-fp8
Text Generation
• 7B • Updated
• 6
FlorianJc/openchat-3.6-8b-20240522-vllm-fp8
Text Generation
• 8B • Updated
• 3
FlorianJc/Llama3-ChatQA-1.5-8B-vllm-fp8
Text Generation
• 8B • Updated
• 1
TechxGenus/Codestral-22B-v0.1-FP8
Text Generation
• 22B • Updated
• 57
Model-SafeTensors/Meta-Llama-3-70B-FP8-Dynamic
Text Generation
• 71B • Updated
• 2
Model-SafeTensors/Qwen-Qwen2-72B-FP8-Dynamic
Text Generation
• 73B • Updated
• 8
RedHatAI/Meta-Llama-3-70B-Instruct-FP8-KV
Text Generation
• 71B • Updated
• 4
• 3
RedHatAI/Mistral-7B-Instruct-v0.3-FP8
Text Generation
• 7B • Updated
• 2.36k
• 3
RedHatAI/Llama-2-7b-chat-hf-FP8
Text Generation
• 7B • Updated
• 322
RedHatAI/Phi-3-mini-128k-instruct-FP8
Text Generation
• 4B • Updated
• 16
RedHatAI/Phi-3-medium-128k-instruct-FP8
Text Generation
• 14B • Updated
• 131
• 5
Rallio67/llama-3-70B-actions-FP8
Text Generation
• 71B • Updated
• 2
nerdylive/Meta-Llama-3-8B-Instruct-FP8
Text Generation
• 8B • Updated
• 1
FlorianJc/google-gemma-2-9b-it-vllm-fp8
Text Generation
• 9B • Updated
• 11
• 1
tranhoangnguyen03/Gemma-2-9B-It-SPPO-Iter3_Q8
Text Generation
• 9B • Updated
• 1