Niujunbo2002
/

qwen2vit-665m-patch14-native

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions

qwen2vit-665m-patch14-native / README.md

Niujunbo2002's picture

Add model card (#1)

ecce88a verified 9 months ago

|

history blame contribute delete

347 Bytes

license: apache-2.0
library_name: transformers
pipeline_tag: image-text-to-text

This repository contains the NativeRes-LLaVA model described in the paper Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models.

Code: https://github.com/Niujunbo2002/NativeRes-LLaVA