Spaces:

TIGER-Lab
/

MMEB-Leaderboard

Running on CPU Upgrade

App Files Files Community

Upload GVE-7B.json

#107

by Zhuoning - opened Jan 21

base: refs/heads/main

←

from: refs/pr/107

Discussion Files changed

Jan 21

Here, we present the results of the GVE-7B.

Model: GVE-7B
Paper: Towards Universal Video Retrieval: Generalizing Video Embedding via Synthesized Multimodal Pyramid Curriculum
Evaluation: mostly from Qwen3-VL-Embedding

NOTE that:

All Public Data: GVE-7B has been trained on 13M publicly available retrieval data (including 1.55M synthesized data based on public videos), providing detailed reproduction configurations
Fully Zero-shot: No in-domain data for MMEB-V2-Video datasets is included in the training stages of the GVE series
Retrieval Data Only: We do not utilize any video QA, classification, or grounding data
Test-time Scaling: We report the best performance among the tested results from different test-time configurations of context length

Upload GVE-7B.json4084a3c5

ziyjiang changed pull request status to merged Jan 22

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment