RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 Text Generation • 8B • Updated Sep 22, 2025 • 17k • 20
INT8 LLMs for vLLM Collection Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 47 items • Updated 11 days ago • 19
TheBloke/Mistral-7B-Instruct-v0.2-GGUF Text Generation • 7B • Updated Dec 11, 2023 • 89.4k • 501