view article Article Token Merging for fast LLM inference : Background and first trials with Mistral samchain • Apr 30, 2024 • 4