| license: apache-2.0 | |
| library_name: transformers | |
| pipeline_tag: image-text-to-text | |
| This repository contains the NativeRes-LLaVA model described in the paper [Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models](https://huggingface.co/papers/2506.12776). | |
| Code: https://github.com/Niujunbo2002/NativeRes-LLaVA |