MHA2MLA-VLM - a cnxup Collection

cnxup 's Collections

updated Jan 24

The MHA2MLA-VLM model published in the paper "MHA2MLA-VLM: Enabling DeepSeek's Economical Multi-Head Latent Attention across Vision-Language Models"