File size: 506 Bytes
cae4de0 |
1 2 3 4 5 6 7 8 9 10 11 |
---
pipeline_tag: image-feature-extraction
library_name: transformers
license: apache-2.0
---
This repository contains the OpenVision model, a fully-open and cost-effective family of advanced vision encoders for multimodal learning, as described in the paper [OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning](https://huggingface.co/papers/2505.04601).
Project Page: https://ucsc-vlaa.github.io/OpenVision/
Code: https://github.com/UCSC-VLAA/OpenVision |