keras
/

dinov2_base

KerasHub

Model card Files Files and versions

xet

Community

prasadsachin commited on Feb 26

Commit

f524393

verified ·

1 Parent(s): 196e39b

Update README.md with new model card content

Browse files

Files changed (1) hide show

README.md +46 -20

README.md CHANGED Viewed

@@ -1,23 +1,49 @@
 ---
 library_name: keras-hub
 ---
-This is a [`DINOV2` model](https://keras.io/api/keras_hub/models/dinov2) uploaded using the KerasHub library and can be used with JAX, TensorFlow, and PyTorch backends.
-Model config:
-* **name:** dinov2_backbone
-* **trainable:** True
-* **patch_size:** 14
-* **num_layers:** 12
-* **hidden_dim:** 768
-* **num_heads:** 12
-* **intermediate_dim:** 3072
-* **layer_scale_init_value:** 1.0
-* **num_register_tokens:** 0
-* **use_mask_token:** True
-* **use_swiglu_ffn:** False
-* **dropout_rate:** 0.0
-* **drop_path_rate:** 0.0
-* **image_shape:** [518, 518, 3]
-* **position_embedding_shape:** [518, 518]
-* **antialias_in_interpolation:** False
-This model card has been generated automatically and should be completed by the model author. See [Model Cards documentation](https://huggingface.co/docs/hub/model-cards) for more information.

 ---
 library_name: keras-hub
 ---
+### Model Overview
+Vision Transformer (ViT) model trained using the DINOv2 method.
+**Reference**
+- [Learning Robust Visual Features without Supervision](https://arxiv.org/abs/2304.07193)
+- [Vision Transformers Need Registers](https://arxiv.org/abs/2309.16588)
+DINOV2 offers a powerful, generalist visual backbone learned entirely from
+unlabeled images as described in DINOv2: Learning Robust Visual Features
+without Supervision
+## Links
+* [DINOv2 Quickstart Notebook] - coming soon
+* [DINOv2 API Documentation] - coming soon
+* [DINOv2 Beginner Guide] - coming soon
+* [KerasHub Model Publishing Guide](https://keras.io/guides/keras_hub/upload/)
+## Installation
+Keras and KerasHub can be installed with:
+```
+pip install -U -q keras-hub
+pip install -U -q keras
+```
+Jax, TensorFlow, and Torch come preinstalled in Kaggle Notebooks. For instructions on installing them in another environment see the [Keras Getting Started](https://keras.io/getting_started/) page.
+## Presets
+The following model checkpoints are provided by the Keras team. Weights have been ported from: https://huggingface.co. Full code examples for each are available below.
+| Preset name                        | Parameters | Description                                                                                                                                                                                                                                                                                               |
+|------------------------------------|------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| dinov2_small | 22.58M      | Vision Transformer (small-sized model) trained using DINOv2. |
+| dinov2_base | 87.63M      | Vision Transformer (base-sized model) trained using DINOv2. |
+| dinov2_large | 305.77M      | Vision Transformer (large-sized model) trained using DINOv2. |
+| dinov2_giant | 1.13B     | Vision Transformer (giant-sized model) trained using DINOv2.|
+| dinov2_with_registers_small | 22.58M      | Vision Transformer (small-sized model) trained using DINOv2, with registers. |
+| dinov2_with_registers_base | 87.63M      | Vision Transformer (base-sized model) trained using DINOv2, with registers. |
+| dinov2_with_registers_large | 305.77M      | Vision Transformer (large-sized model) trained using DINOv2, with registers. |
+| dinov2_with_registers_giant | 1.13B     | Vision Transformer (giant-sized model) trained using DINOv2, with registers.|