| | --- |
| | license: gpl-3.0 |
| | datasets: |
| | - JosephusCheung/GuanacoVQADataset |
| | language: |
| | - en |
| | - zh |
| | - ja |
| | - de |
| | pipeline_tag: visual-question-answering |
| | --- |
| | |
| | The following content is currently a work in progress and does not represent the final quality. |
| |
|
| | Alignment for the multilingual VQA tasks is being conducted on blip2-flan-t5-xxl and Guanaco using only Linear Layers. |
| |
|
| | The latest weight file is provided here, based on the implementation of MiniGPT-4. |
| |
|
| | This model supports English, Chinese, Japanese, and German languages and requires the combined use of the Guanaco 7B LLM model. |
| |
|
| | A portion of the dataset has already been released. |