metadata
license: apache-2.0
library_name: transformers
pipeline_tag: image-text-to-text
This repository contains the NativeRes-LLaVA model described in the paper Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models.