Add pipeline tag and library name
#2
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -1,7 +1,9 @@
|
|
| 1 |
---
|
| 2 |
-
license: mit
|
| 3 |
base_model:
|
| 4 |
- inclusionAI/Ling-lite
|
|
|
|
|
|
|
|
|
|
| 5 |
---
|
| 6 |
|
| 7 |
# Ming-Lite-Omni
|
|
@@ -115,7 +117,7 @@ Note: All models are evaluated based on 128 uniformly sampled frames.
|
|
| 115 |
<div align="center">
|
| 116 |
|
| 117 |
| Model | aishell1 | aishell2_android | aishell2_ios | cv15_zh | fleurs_zh | wenetspeech_meeting | wenetspeech_net | librispeech_test_clean | librispeech_test_other | multilingual_librispeech | cv15_en | fleurs_en | voxpopuli_v1.0_en |
|
| 118 |
-
|
| 119 |
| Ming-lite-omni | 1.47 | **2.55** | **2.52** | 6.31 | 2.96 | 5.95 | 5.46 | 1.44 | 2.80 | **4.15** | **6.89** | **3.39** | **5.80** |
|
| 120 |
| Qwen2.-Omni | 1.18 | 2.75 | 2.63 | **5.20** | 3.00 | **5.90** | 7.70 | 1.80 | 3.40 | 7.56 | 7.60 | 4.10 | **5.80** |
|
| 121 |
| Qwen2-Audio | 1.53 | 2.92 | 2.92 | 6.90 | 7.50 | 7.16 | 8.42 | 1.60 | 3.60 | 5.40 | 8.60 | 6.90 | 6.84 |
|
|
@@ -200,8 +202,6 @@ If you're in mainland China, we strongly recommend you to download our model fro
|
|
| 200 |
Additional demonstration cases are available on our project [page](https://lucaria-academy.github.io/Ming-Omni/).
|
| 201 |
|
| 202 |
|
| 203 |
-
|
| 204 |
-
|
| 205 |
## Example Usage
|
| 206 |
|
| 207 |
Please download our model following [Model Downloads](#model-downloads), then you can refer to the following codes to run Ming-lite-omni model.
|
|
@@ -275,19 +275,31 @@ messages = [
|
|
| 275 |
To enable thinking before response, adding the following system prompt before your question:
|
| 276 |
|
| 277 |
```python
|
| 278 |
-
cot_prompt = "SYSTEM: You are a helpful assistant. When the user asks a question, your response must include two parts: first, the reasoning process enclosed in <thinking>...</thinking> tags, then the final answer enclosed in <answer>...</answer> tags. The critical answer or key result should be placed within \\boxed{}
|
|
|
|
| 279 |
# And your input message should be like this:
|
| 280 |
messages = [
|
| 281 |
{
|
| 282 |
"role": "HUMAN",
|
| 283 |
"content": [
|
| 284 |
{"type": "image", "image": os.path.join(assets_path, "reasoning.png")},
|
| 285 |
-
{"type": "text", "text": cot_prompt + "In the rectangle $A B C D$ pictured, $M_{1}$ is the midpoint of $D C, M_{2}$ the midpoint of $A M_{1}, M_{3}$ the midpoint of $B M_{2}$ and $M_{4}$ the midpoint of $C M_{3}$. Determine the ratio of the area of the quadrilateral $M_{1} M_{2} M_{3} M_{4}$ to the area of the rectangle $A B C D
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 286 |
],
|
| 287 |
},
|
| 288 |
]
|
| 289 |
# Output:
|
| 290 |
-
# \<think
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 291 |
```
|
| 292 |
|
| 293 |
```python
|
|
@@ -547,5 +559,4 @@ If you find our work helpful, feel free to give us a cite.
|
|
| 547 |
archivePrefix = {arXiv},
|
| 548 |
url = {https://arxiv.org/abs/2506.09344}
|
| 549 |
}
|
| 550 |
-
```
|
| 551 |
-
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
base_model:
|
| 3 |
- inclusionAI/Ling-lite
|
| 4 |
+
license: mit
|
| 5 |
+
pipeline_tag: any-to-any
|
| 6 |
+
library_name: transformers
|
| 7 |
---
|
| 8 |
|
| 9 |
# Ming-Lite-Omni
|
|
|
|
| 117 |
<div align="center">
|
| 118 |
|
| 119 |
| Model | aishell1 | aishell2_android | aishell2_ios | cv15_zh | fleurs_zh | wenetspeech_meeting | wenetspeech_net | librispeech_test_clean | librispeech_test_other | multilingual_librispeech | cv15_en | fleurs_en | voxpopuli_v1.0_en |
|
| 120 |
+
|:--------------:|:--------:|:----------------:|:------------:|:--------:|:---------:|:-------------------:|:---------------:|:----------------------:|:----------------------:|:------------------------:|:--------:|:---------:|:--------------------:|\
|
| 121 |
| Ming-lite-omni | 1.47 | **2.55** | **2.52** | 6.31 | 2.96 | 5.95 | 5.46 | 1.44 | 2.80 | **4.15** | **6.89** | **3.39** | **5.80** |
|
| 122 |
| Qwen2.-Omni | 1.18 | 2.75 | 2.63 | **5.20** | 3.00 | **5.90** | 7.70 | 1.80 | 3.40 | 7.56 | 7.60 | 4.10 | **5.80** |
|
| 123 |
| Qwen2-Audio | 1.53 | 2.92 | 2.92 | 6.90 | 7.50 | 7.16 | 8.42 | 1.60 | 3.60 | 5.40 | 8.60 | 6.90 | 6.84 |
|
|
|
|
| 202 |
Additional demonstration cases are available on our project [page](https://lucaria-academy.github.io/Ming-Omni/).
|
| 203 |
|
| 204 |
|
|
|
|
|
|
|
| 205 |
## Example Usage
|
| 206 |
|
| 207 |
Please download our model following [Model Downloads](#model-downloads), then you can refer to the following codes to run Ming-lite-omni model.
|
|
|
|
| 275 |
To enable thinking before response, adding the following system prompt before your question:
|
| 276 |
|
| 277 |
```python
|
| 278 |
+
cot_prompt = "SYSTEM: You are a helpful assistant. When the user asks a question, your response must include two parts: first, the reasoning process enclosed in <thinking>...</thinking> tags, then the final answer enclosed in <answer>...</answer> tags. The critical answer or key result should be placed within \\boxed{}.
|
| 279 |
+
"
|
| 280 |
# And your input message should be like this:
|
| 281 |
messages = [
|
| 282 |
{
|
| 283 |
"role": "HUMAN",
|
| 284 |
"content": [
|
| 285 |
{"type": "image", "image": os.path.join(assets_path, "reasoning.png")},
|
| 286 |
+
{"type": "text", "text": cot_prompt + "In the rectangle $A B C D$ pictured, $M_{1}$ is the midpoint of $D C, M_{2}$ the midpoint of $A M_{1}, M_{3}$ the midpoint of $B M_{2}$ and $M_{4}$ the midpoint of $C M_{3}$. Determine the ratio of the area of the quadrilateral $M_{1} M_{2} M_{3} M_{4}$ to the area of the rectangle $A B C D$.
|
| 287 |
+
Choices:
|
| 288 |
+
(A) $\\frac{7}{16}$
|
| 289 |
+
(B) $\\frac{3}{16}$
|
| 290 |
+
(C) $\\frac{7}{32}$
|
| 291 |
+
(D) $\\frac{9}{32}$
|
| 292 |
+
(E) $\\frac{1}{5}$"},
|
| 293 |
],
|
| 294 |
},
|
| 295 |
]
|
| 296 |
# Output:
|
| 297 |
+
# \<think\>
|
| 298 |
+
Okay, so I have this problem about a rectangle ABCD ... (thinking process omitted) ... So, the correct answer is C.
|
| 299 |
+
\</think\>
|
| 300 |
+
\<answer\>\\boxed{C}\</answer\>
|
| 301 |
+
|
| 302 |
+
|
| 303 |
```
|
| 304 |
|
| 305 |
```python
|
|
|
|
| 559 |
archivePrefix = {arXiv},
|
| 560 |
url = {https://arxiv.org/abs/2506.09344}
|
| 561 |
}
|
| 562 |
+
```
|
|
|