Update README.md
Browse files
README.md
CHANGED
|
@@ -7,7 +7,9 @@ license: mit
|
|
| 7 |
|
| 8 |
## ImageDPO Finetuned Model
|
| 9 |
|
| 10 |
-
This page provides the **ImageDPO** finetuned checkpoint for LLaVA-v1.5-7B used in [Probing Visual Language Priors in VLMs](https://arxiv.org/abs/2501.00569). ImageDPO is a self-improving approach to enhance VLM visual reasoning performance by increasing reliance on visual inputs. We offer the **merged model weights** for use.
|
|
|
|
|
|
|
| 11 |
|
| 12 |
## Usage
|
| 13 |
|
|
|
|
| 7 |
|
| 8 |
## ImageDPO Finetuned Model
|
| 9 |
|
| 10 |
+
This page provides the **ImageDPO** finetuned checkpoint for LLaVA-v1.5-7B used in [Probing Visual Language Priors in VLMs](https://arxiv.org/abs/2501.00569). ImageDPO is a self-improving approach to enhance VLM visual reasoning performance by increasing reliance on visual inputs as illustrated in the below image. We offer the **merged model weights** for use.
|
| 11 |
+
|
| 12 |
+

|
| 13 |
|
| 14 |
## Usage
|
| 15 |
|