KerasHub
dinov2_base / README.md
prasadsachin's picture
Update README.md with new model card content
f524393 verified
---
library_name: keras-hub
---
### Model Overview
Vision Transformer (ViT) model trained using the DINOv2 method.
**Reference**
- [Learning Robust Visual Features without Supervision](https://arxiv.org/abs/2304.07193)
- [Vision Transformers Need Registers](https://arxiv.org/abs/2309.16588)
DINOV2 offers a powerful, generalist visual backbone learned entirely from
unlabeled images as described in DINOv2: Learning Robust Visual Features
without Supervision
## Links
* [DINOv2 Quickstart Notebook] - coming soon
* [DINOv2 API Documentation] - coming soon
* [DINOv2 Beginner Guide] - coming soon
* [KerasHub Model Publishing Guide](https://keras.io/guides/keras_hub/upload/)
## Installation
Keras and KerasHub can be installed with:
```
pip install -U -q keras-hub
pip install -U -q keras
```
Jax, TensorFlow, and Torch come preinstalled in Kaggle Notebooks. For instructions on installing them in another environment see the [Keras Getting Started](https://keras.io/getting_started/) page.
## Presets
The following model checkpoints are provided by the Keras team. Weights have been ported from: https://huggingface.co. Full code examples for each are available below.
| Preset name | Parameters | Description |
|------------------------------------|------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| dinov2_small | 22.58M | Vision Transformer (small-sized model) trained using DINOv2. |
| dinov2_base | 87.63M | Vision Transformer (base-sized model) trained using DINOv2. |
| dinov2_large | 305.77M | Vision Transformer (large-sized model) trained using DINOv2. |
| dinov2_giant | 1.13B | Vision Transformer (giant-sized model) trained using DINOv2.|
| dinov2_with_registers_small | 22.58M | Vision Transformer (small-sized model) trained using DINOv2, with registers. |
| dinov2_with_registers_base | 87.63M | Vision Transformer (base-sized model) trained using DINOv2, with registers. |
| dinov2_with_registers_large | 305.77M | Vision Transformer (large-sized model) trained using DINOv2, with registers. |
| dinov2_with_registers_giant | 1.13B | Vision Transformer (giant-sized model) trained using DINOv2, with registers.|