Update README.md
Browse files
README.md
CHANGED
|
@@ -10,16 +10,18 @@ base_model:
|
|
| 10 |
This model is a fine-tuned version of microsoft/Phi-4-multimodal-instruct on the Galaxy's Last Exam Benchmark.
|
| 11 |
|
| 12 |
<p align="center">
|
| 13 |
-
<img width="
|
| 14 |
</p>
|
| 15 |
|
| 16 |
|
| 17 |
## Model description
|
| 18 |
-
Galactus is a state-of-the-art (SOTA) multimodal language model that outperforms all OpenAI and Gemini models on the Galaxy's Last Exam Benchmark.
|
| 19 |
-
|
|
|
|
| 20 |
|
| 21 |
## Intended uses & limitations
|
| 22 |
This model is intended for handling complex visual reasoning tasks that require metaphysical competence.
|
|
|
|
| 23 |
|
| 24 |
## Training and evaluation data
|
| 25 |
The model was exclusively trained on the Galaxy's Last Exam Benchmark.
|
|
|
|
| 10 |
This model is a fine-tuned version of microsoft/Phi-4-multimodal-instruct on the Galaxy's Last Exam Benchmark.
|
| 11 |
|
| 12 |
<p align="center">
|
| 13 |
+
<img width="50%" src="Main_Image.png">
|
| 14 |
</p>
|
| 15 |
|
| 16 |
|
| 17 |
## Model description
|
| 18 |
+
Galactus is a state-of-the-art (SOTA) multimodal language model that outperforms all OpenAI and Gemini models on the Galaxy's Last Exam Benchmark.
|
| 19 |
+
This benchmark features challenging tasks that push the boundaries of metaphysical competence—for instance, determining how many times two lines intersect or simulating the effect of adding three minutes to an analog clock.
|
| 20 |
+
The model accepts image input along with text prompts and has been specifically optimized to tackle the most complex visual reasoning tasks.
|
| 21 |
|
| 22 |
## Intended uses & limitations
|
| 23 |
This model is intended for handling complex visual reasoning tasks that require metaphysical competence.
|
| 24 |
+
Please do not use for normal human tasks.
|
| 25 |
|
| 26 |
## Training and evaluation data
|
| 27 |
The model was exclusively trained on the Galaxy's Last Exam Benchmark.
|