view article Article Optimum-NVIDIA Unlocking blazingly fast LLM inference in just 1 line of code Dec 5, 2023 • 5