CUDAOUTOFMEMORY
/

PLUME-Qwen2-VL-2B

Feature Extraction

image-text-to-text

Model card Files Files and versions

Add metadata and improve model card

#1

by nielsr HF Staff - opened 10 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +21 -6

README.md CHANGED Viewed

@@ -1,19 +1,21 @@
 ---
 datasets:
 - VLM2Vec/MMEB-V2
 language:
 - en
-base_model:
-- Qwen/Qwen2-VL-2B-Instruct
 ---
 # PLUME-Qwen2-VL-2B
 **PLUME: Latent Reasoning Based Universal Multimodal Embedding**
-PLUME is a latent reasoning framework for universal multimodal embedding (UME). It replaces explicit chain-of-thought (CoT) generation with a short autoregressive rollout of continuous latent states, achieving stronger retrieval
-performance while delivering **over 30x faster inference** compared to explicit-CoT methods.
-**[Project Page](https://haoxiangzhao12138.github.io/PLUME/)** | **[Paper](https://arxiv.org/abs/2507.00001)** | **[Code](https://github.com/haoxiangzhao12138/PLUME)**
 ## Highlights
@@ -43,5 +45,18 @@ huggingface-cli download CUDAOUTOFMEMORY/PLUME-Qwen2-VL-2B --local-dir /path/to/
 # Option 2: git clone (requires git-lfs)
 git lfs install
 git clone https://huggingface.co/CUDAOUTOFMEMORY/PLUME-Qwen2-VL-2B
 ```

 ---
+base_model:
+- Qwen/Qwen2-VL-2B-Instruct
 datasets:
 - VLM2Vec/MMEB-V2
 language:
 - en
+library_name: transformers
+pipeline_tag: feature-extraction
 ---
 # PLUME-Qwen2-VL-2B
 **PLUME: Latent Reasoning Based Universal Multimodal Embedding**
+PLUME is a latent reasoning framework for universal multimodal embedding (UME). It replaces explicit chain-of-thought (CoT) generation with a short autoregressive rollout of continuous latent states, achieving stronger retrieval performance while delivering **over 30x faster inference** compared to explicit-CoT methods.
+**[Project Page](https://haoxiangzhao12138.github.io/PLUME/)** | **[Paper](https://arxiv.org/abs/2604.02073)** | **[Code](https://github.com/haoxiangzhao12138/PLUME)**
 ## Highlights
 # Option 2: git clone (requires git-lfs)
 git lfs install
 git clone https://huggingface.co/CUDAOUTOFMEMORY/PLUME-Qwen2-VL-2B
+```
+## Citation
+```bibtex
+@misc{he2026plumelatentreasoningbased,
+      title={PLUME: Latent Reasoning Based Universal Multimodal Embedding},
+      author={Chenwei He and Xiangzhao Hao and Tianyu Yang and Yuxiang Ma and Yuheng Jia and Lingxiang Wu and Chaoyang Zhao and Haiyun Guo and Jinqiao Wang},
+      year={2026},
+      eprint={2604.02073},
+      archivePrefix={arXiv},
+      primaryClass={cs.CV},
+      url={https://arxiv.org/abs/2604.02073},
+}
 ```