zakoman/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T-exl2 Text Generation β’ Updated Dec 12, 2023 β’ 5
SparQ Attention: Bandwidth-Efficient LLM Inference Paper β’ 2312.04985 β’ Published Dec 8, 2023 β’ 40
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module Paper β’ 2311.05556 β’ Published Nov 9, 2023 β’ 86