birder-project
/

davit_tiny_il-all

Image Classification

Model card Files Files and versions

hassonofer commited on Jan 23, 2025

Commit

aabe6f9

·

verified ·

1 Parent(s): bffb510

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ license: apache-2.0
 # Model Card for davit_tiny_il-all
-A Dual Attention Vision Transformer (DaViT) image classification model. This model was trained on the `il-all` dataset (all the relevant bird species found in Israel inc. rarities).
 The species list is derived from data available at <https://www.israbirding.com/checklist/>.
@@ -16,12 +16,12 @@ The species list is derived from data available at <https://www.israbirding.com/
 - **Model Type:** Image classification and detection backbone
 - **Model Stats:**
-  - Params (M): 28.0
-  - Input image size: 384 x 384
 - **Dataset:** il-all (550 classes)
 - **Papers:**
-  - DaViT: Dual Attention Vision Transformers: <https://arxiv.org/abs/2204.03645>
 ## Model Usage
@@ -39,9 +39,9 @@ size = birder.get_size_from_signature(signature)
 # Create an inference transform
 transform = birder.classification_transform(size, rgb_stats)
-image = "path/to/image.jpeg"  # or a PIL image
 (out, _) = infer_image(net, image, transform)
-# out is a NumPy array with shape of (1, num_classes)
 ```
 ### Image Embeddings

 # Model Card for davit_tiny_il-all
+A Dual Attention Vision Transformer (DaViT) image classification model. This model was trained on the `il-all` dataset, encompassing all relevant bird species found in Israel, including rarities.
 The species list is derived from data available at <https://www.israbirding.com/checklist/>.
 - **Model Type:** Image classification and detection backbone
 - **Model Stats:**
+    - Params (M): 28.0
+    - Input image size: 384 x 384
 - **Dataset:** il-all (550 classes)
 - **Papers:**
+    - DaViT: Dual Attention Vision Transformers: <https://arxiv.org/abs/2204.03645>
 ## Model Usage
 # Create an inference transform
 transform = birder.classification_transform(size, rgb_stats)
+image = "path/to/image.jpeg"  # or a PIL image, must be loaded in RGB format
 (out, _) = infer_image(net, image, transform)
+# out is a NumPy array with shape of (1, num_classes), representing class probabilities.
 ```
 ### Image Embeddings