Spaces:

TIGER-Lab
/

MMEB-Leaderboard

Paused

App Files Files Community

139

Add Btoks MMEB-V2 results

#138

by siyrus - opened 8 days ago

base: refs/heads/main

←

from: refs/pr/138

Discussion Files changed

+1116

-0

siyrus

8 days ago

Submit Btoks results for MMEB-V2.

Model: Btoks
Experiment: sieve_qwen3vl_2b_v1_expanded_train_v2_weight0.2_0513
Checkpoint: checkpoint-4500
Technical report: https://arxiv.org/abs/2604.11095

Add Btoks MMEB-V2 resultse5d1b844

siyrus

8 days ago

We submit the MMEB-V2 results for Btoks.

Model: Btoks
Experiment: sieve_qwen3vl_2b_v1_expanded_train_v2_weight0.2_0513
Checkpoint: checkpoint-4500
Technical report / arXiv: https://arxiv.org/abs/2604.11095

Local validation with the leaderboard scripts gives:
Overall 68.29, Image-Overall 71.55, Video-Overall 49.12, Visdoc-Overall 77.77.

Clarify Btoks base modelb31d7e1b

siyrus

8 days ago

Additional model detail: Btoks is fine-tuned from Qwen/Qwen3-VL-2B-Instruct. I updated scores/Btoks.json to include this as metadata ( and ). The leaderboard scores are unchanged.

siyrus

8 days ago

Additional model detail, clarified: Btoks is fine-tuned from Qwen/Qwen3-VL-2B-Instruct. The submission JSON now records model_backbone = Qwen3-VL-2B-Instruct and base_model = Qwen/Qwen3-VL-2B-Instruct. The leaderboard scores are unchanged.

Link Btoks checkpoint repository35ddb580

ziyjiang changed pull request status to merged 8 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment