view article Article The NLP Course is becoming the LLM Course +8 burtenshaw, reach-vb, lewtun, fdaudens, pcuenq, tomaarsen, coyotte508, mishig, sergiopaniego, julien-c • Apr 3, 2025 • 106
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26, 2024 • 57
FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 42 items • Updated Mar 2 • 80
INT8 LLMs for vLLM Collection Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 47 items • Updated Mar 2 • 20