File size: 978 Bytes
a86c03e
 
 
 
 
 
 
 
 
d646fd2
a675c60
d646fd2
 
 
 
 
 
 
 
bf86f74
ae82d28
d646fd2
7274583
c074b05
bf86f74
a675c60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
title: README
emoji: 🌍
colorFrom: pink
colorTo: green
sdk: static
pinned: false
---

# VLM2Vec & MMEB: Benchmarking multimodal embeddings and adapting state-of-the-art multimodal large language models into embedding models.

- **Website** - https://tiger-ai-lab.github.io/VLM2Vec/

- **Github** https://github.com/TIGER-AI-Lab/VLM2Vec


## List of Our Papers

### Main VLM2Vec / MMEB Series
- **[VLM2Vec / MMEB](https://arxiv.org/pdf/2410.05160)** – Image embedding benchmarking and models.  (ICLR2025)
- **[VLM2Vec-V2 / MMEB-V2](https://arxiv.org/pdf/2507.04590)** – Extension of our previous work to video and visual document tasks.  (TMLR2026) 

### Other Related Papers from Our Team
- **[GAE-Retriever](https://arxiv.org/pdf/2506.22056)** – Benchmark and model for trajectory modeling in GUI environments.  (Computer-use Agents@ICML 2025)
- **[B3](https://arxiv.org/pdf/2505.11293)** – A novel batch mining strategy for contrastive learning.  (Neurips2025)