| license: apache-2.0 | |
| library_name: mlx | |
| tags: | |
| - image-classification | |
| - vision | |
| datasets: | |
| - imagenet | |
| - imagenet-1k | |
| # Data2Vec-Vision (large-sized model, fine-tuned on ImageNet-1k) | |
|  | |
| BEiT model pre-trained in a self-supervised fashion and fine-tuned on ImageNet-1k (1,2 million images, 1000 classes) at resolution 224x224. It was introduced in the paper [data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language](https://arxiv.org/abs/2202.03555) by Alexei Baevski, Wei-Ning Hsu, Qiantong Xu, Arun Babu, Jiatao Gu, Michael Auli and first released in [this repository](https://github.com/facebookresearch/data2vec_vision/tree/main/beit). | |
| ## Usage | |
| ```python | |
| from mlx_ssl.models import Data2VecVisionForImageClassification | |
| model = Data2VecVisionForImageClassification.from_pretrained( | |
| "mlx-community/data2vec-vision-large-ft1k" | |
| ) | |
| ``` |