view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference mfuntowicz, hlarcher • Jan 16, 2025 • 76
view article Article Making LLMs lighter with AutoGPTQ and transformers +4 marcsun13, fxmarty, PanEa, qwopqwop, ybelkada, TheBloke • Aug 23, 2023 • 64