Niujunbo2002's picture
Add model card (#1)
ecce88a verified
metadata
license: apache-2.0
library_name: transformers
pipeline_tag: image-text-to-text

This repository contains the NativeRes-LLaVA model described in the paper Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models.

Code: https://github.com/Niujunbo2002/NativeRes-LLaVA