PyPE-LLaVA-7B / README.md
Sakura927's picture
Update README.md
2f2f6ed verified
metadata
license: apache-2.0
language:
  - en
metrics:
  - accuracy
pipeline_tag: image-text-to-text
library_name: transformers

PyPE: Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding

For more details, please refer to Github: PyPE.