| | --- |
| | pipeline_tag: image-feature-extraction |
| | library_name: transformers |
| | license: apache-2.0 |
| | --- |
| | |
| | This repository contains the OpenVision model, a fully-open and cost-effective family of advanced vision encoders for multimodal learning, as described in the paper [OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning](https://huggingface.co/papers/2505.04601). |
| |
|
| | Project Page: https://ucsc-vlaa.github.io/OpenVision/ |
| |
|
| | Code: https://github.com/UCSC-VLAA/OpenVision |