mmthinking
/

Metis-HOME

Safetensors

qwen2_5_vl_moe

Model card Files Files and versions

xet

Community

Improve model card: Add pipeline tag, library name, and correct GitHub link

by nielsr HF Staff - opened Nov 26, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

-4

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -1,12 +1,14 @@
 ---
 license: apache-2.0
 ---
 <h1 align="center">Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal Reasoning</h1>
 <h5 align="center">
-[![arXiv](https://img.shields.io/badge/Arxiv-2510.20519-b31b1b.svg?logo=arXiv)](https://arxiv.org/pdf/2510.20519)&ensp;<a href='https://huggingface.co/mmthinking/Metis-HOME'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face%20-models-blue'></a>&ensp;[![Code License](https://img.shields.io/badge/License-Apache_2.0-green.svg)](https://github.com/tatsu-lab/stanford_alpaca/blob/main/LICENSE)
 </h5>
@@ -14,7 +16,7 @@ license: apache-2.0
 ## 💡 Overview
 Current multimodal reasoning models face a critical dilemma: they often "overthink" on simple tasks (inefficiency) and suffer from general capability degradation when optimized for reasoning.
-We introduce **Metis-HOME** (**H**ybrid **O**ptimized **M**ixture-of-**E**xperts), a novel framework that enables a "Hybrid Thinking" paradigm. By structuring the original dense model (Qwen2.5-VL-7B) into two distinct expert branches—a Thinking Expert for complex reasoning and a Non-Thinking Expert for rapid inference—controlled by a lightweight router, Metis-HOME effectively resolves the reasoning-vs-generalization trade-off.
 <div style="display: flex; justify-content: center; gap: 20px; flex-wrap: wrap;">
   <img src="https://raw.githubusercontent.com/MM-Thinking/Metis-HOME/main/assets/framework.png" alt="Metis-RISE Framework Overview" style="width:400px; max-width:100%;">
@@ -36,8 +38,8 @@ We introduce **Metis-HOME** (**H**ybrid **O**ptimized **M**ixture-of-**E**xperts
 ### Thinking Ratio
 As shown in the following figure, the **thinking ratio** analysis of Metis-HOME reveals adaptive routing behavior:
-- **High ratios (78\%–98\%)** on reasoning-heavy benchmarks (*WeMath*, *MathVision*, etc.), indicating effective use of the *thinking expert* for multi-step inference.
-- **Low ratios (2\%–5\%)** on general benchmarks (*MMBench*, *OCRBench*), showing preference for the *non-thinking expert*.
 This aligns with our design: **deliberate reasoning for complex tasks**, **fast inference for simple ones**, optimizing computational efficiency.

 ---
 license: apache-2.0
+pipeline_tag: image-text-to-text
+library_name: transformers
 ---
 <h1 align="center">Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal Reasoning</h1>
 <h5 align="center">
+[![arXiv](https://img.shields.io/badge/Arxiv-2510.20519-b31b1b.svg?logo=arXiv)](https://arxiv.org/pdf/2510.20519)&ensp;<a href='https://huggingface.co/mmthinking/Metis-HOME'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face%20-models-blue'></a>&ensp;[![Code](https://img.shields.io/badge/GitHub-Code-blue.svg?logo=github)](https://github.com/MM-Thinking/Metis-HOME)
 </h5>
 ## 💡 Overview
 Current multimodal reasoning models face a critical dilemma: they often "overthink" on simple tasks (inefficiency) and suffer from general capability degradation when optimized for reasoning.
+We introduce **Metis-HOME** (**H**ybrid **O**ptimized **M**ixture-of-**E**xperts), a novel framework that enables a "Hybrid Thinking" paradigm. By structuring the original dense model (Qwen2.5-VL-7B) into two distinct expert branches: a Thinking Expert for complex reasoning, and a Non-Thinking Expert for rapid inference, controlled by a lightweight router, Metis-HOME effectively resolves the reasoning-vs-generalization trade-off.
 <div style="display: flex; justify-content: center; gap: 20px; flex-wrap: wrap;">
   <img src="https://raw.githubusercontent.com/MM-Thinking/Metis-HOME/main/assets/framework.png" alt="Metis-RISE Framework Overview" style="width:400px; max-width:100%;">
 ### Thinking Ratio
 As shown in the following figure, the **thinking ratio** analysis of Metis-HOME reveals adaptive routing behavior:
+- **High ratios (78%–98%)** on reasoning-heavy benchmarks (*WeMath*, *MathVision*, etc.), indicating effective use of the *thinking expert* for multi-step inference.
+- **Low ratios (2%–5%)** on general benchmarks (*MMBench*, *OCRBench*), showing preference for the *non-thinking expert*.
 This aligns with our design: **deliberate reasoning for complex tasks**, **fast inference for simple ones**, optimizing computational efficiency.