cnxup commited on
Commit
083df83
·
verified ·
1 Parent(s): 863e672

Upload folder using huggingface_hub

Browse files
Qwen2.5-VL-7B-rope32-d_kv_128.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c9148ff63a705b03a6f0141cfa68c2c3e15baab92c75e4505e5136ffb94f5bcb
3
+ size 513855082
Qwen2.5-VL-7B-rope32-d_kv_32.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d6c47ebb75a0e585efc6bff9a14502769eb784a858226e45f1ba3cc9e0c64417
3
+ size 128502782
Qwen2.5-VL-7B-rope32-d_kv_64.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:41e1e81d12ba9f917fc01f3e571a855cf31769bf437b1ab4d0ddb7c4945e40a5
3
+ size 256953790
README.md CHANGED
@@ -1,3 +1,31 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+
3
+ **Research Paper**: ["MHA2MLA-VLM: Enabling DeepSeek's Economical Multi-Head Latent Attention across Vision-Language Models"](https://arxiv.org/abs/2601.11464)
4
+
5
+ ## Description
6
+
7
+ This repository contains **our proposed MD-SVD (Modality-Decoupled Singular Value Decomposition) initialization weights** extracted from Stage 1 checkpoints for initializing Stage 2 MHA2MLA-VLM models, which independently compresses visual and textual KV spaces, enabling efficient compression while maintaining model performance.
8
+
9
+ ## Available Weight Files
10
+
11
+ | File Name | Latent Dimension (d_kv) |
12
+ |-----------|------------------------|
13
+ | `Qwen2.5-VL-7B-rope32-d_kv_32.pt` | 32 |
14
+ | `Qwen2.5-VL-7B-rope32-d_kv_64.pt` | 64 |
15
+ | `Qwen2.5-VL-7B-rope32-d_kv_128.pt` | 128 |
16
+
17
+
18
+
19
+ ## Citation
20
+
21
+ ```bibtex
22
+ @misc{fan2026mha2mlavlmenablingdeepseekseconomical,
23
+ title={MHA2MLA-VLM: Enabling DeepSeek's Economical Multi-Head Latent Attention across Vision-Language Models},
24
+ author={Xiaoran Fan and Zhichao Sun and Tao Ji and Lixing Shen and Tao Gui},
25
+ year={2026},
26
+ eprint={2601.11464},
27
+ archivePrefix={arXiv},
28
+ primaryClass={cs.CV},
29
+ url={https://arxiv.org/abs/2601.11464},
30
+ }
31
+ ```