BLINK: Multimodal Large Language Models Can See but Not Perceive Paper • 2404.12390 • Published Apr 18, 2024 • 27
view article Article Quanto: a PyTorch quantization backend for Optimum +1 dacorvo, ybelkada, marcsun13 • Mar 18, 2024 • 45
DeepSparse Sparse LLMs Collection Useful LLMs for DeepSparse where we've pruned at least 50% of the weights! • 9 items • Updated Mar 2 • 5