Update README.md
Browse files
README.md
CHANGED
|
@@ -44,8 +44,8 @@ Unlike general-purpose multimodal models, LandAI-Base has been fine-tuned on a c
|
|
| 44 |
|
| 45 |
LandAI-Base demonstrates a substantial leap in reasoning capabilities compared to its backbone model. In the **GeoTest2025** benchmark (derived from restricted 2025 National Postgraduate Entrance Examination questions), it achieves near-commercial performance.
|
| 46 |
|
| 47 |
-
| Model | GeoTest2025 (Geography) | AIME 2024
|
| 48 |
-
| :--- | :---: | :---: | :---: |
|
| 49 |
| **LandAI-Base-7B (Ours)** | **93.3%** | **16.7%** | **66.4%** | **44.7%** |
|
| 50 |
| Qwen2.5-VL-7B (Baseline) | 46.7% | 3.3% | 67.3% | 41.2% |
|
| 51 |
| GPT-4o | 92.1% | 9.3% | 90.2% | 51.9% |
|
|
|
|
| 44 |
|
| 45 |
LandAI-Base demonstrates a substantial leap in reasoning capabilities compared to its backbone model. In the **GeoTest2025** benchmark (derived from restricted 2025 National Postgraduate Entrance Examination questions), it achieves near-commercial performance.
|
| 46 |
|
| 47 |
+
| Model | GeoTest2025 (Geography) | AIME 2024 | HumanEval | MMMU pro |
|
| 48 |
+
| :--- | :---: | :---: | :---: | :---: |
|
| 49 |
| **LandAI-Base-7B (Ours)** | **93.3%** | **16.7%** | **66.4%** | **44.7%** |
|
| 50 |
| Qwen2.5-VL-7B (Baseline) | 46.7% | 3.3% | 67.3% | 41.2% |
|
| 51 |
| GPT-4o | 92.1% | 9.3% | 90.2% | 51.9% |
|