Add model citation and fine-tuning dataset note
#2
by smojitoy - opened
README.md
CHANGED
|
@@ -157,9 +157,11 @@ pose_labels/
|
|
| 157 |
|
| 158 |
or as a CSV with `image_path, pose` columns. Class counts are inherently imbalanced and are handled at the sampler level (see below).
|
| 159 |
|
| 160 |
-
|
| 161 |
|
| 162 |
-
|
|
|
|
|
|
|
| 163 |
|
| 164 |
### Training Procedure
|
| 165 |
|
|
@@ -303,27 +305,34 @@ Training objective: cross-entropy with label smoothing (0.1), optimized only ove
|
|
| 303 |
|
| 304 |
## Citation
|
| 305 |
|
| 306 |
-
|
| 307 |
-
|
| 308 |
-
<!--
|
| 309 |
-
If you use our model in your work, please cite the model and any associated paper.
|
| 310 |
|
| 311 |
**Model**
|
| 312 |
-
|
| 313 |
-
|
| 314 |
-
|
| 315 |
-
|
| 316 |
-
title = {DINOv2 8-Class Animal Pose Classifier},
|
| 317 |
-
version = {<version#>},
|
| 318 |
year = {2026},
|
| 319 |
-
url = {https://huggingface.co/imageomics/mmla-dino-pose}
|
|
|
|
| 320 |
}
|
| 321 |
```
|
| 322 |
-
-->
|
| 323 |
|
| 324 |
-
|
| 325 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 326 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
| 327 |
@article{oquab2023dinov2,
|
| 328 |
title = {DINOv2: Learning Robust Visual Features without Supervision},
|
| 329 |
author = {Oquab, Maxime and Darcet, Timoth{\'e}e and Moutakanni, Th{\'e}o and Vo, Huy V. and Szafraniec, Marc and Khalidov, Vasil and Fernandez, Pierre and Haziza, Daniel and Massa, Francisco and El-Nouby, Alaaeldin and others},
|
|
|
|
| 157 |
|
| 158 |
or as a CSV with `image_path, pose` columns. Class counts are inherently imbalanced and are handled at the sampler level (see below).
|
| 159 |
|
| 160 |
+
This model was fine-tuned on the MMLA pose dataset:
|
| 161 |
|
| 162 |
+
- Dataset: [imageomics/mmla-pose](https://huggingface.co/datasets/imageomics/mmla-pose)
|
| 163 |
+
|
| 164 |
+
The dataset contains cropped images of zebras from MMLA drone footage labeled with one of eight pose orientations: front, front-left, front-right, left, back-left, back, back-right, and right.
|
| 165 |
|
| 166 |
### Training Procedure
|
| 167 |
|
|
|
|
| 305 |
|
| 306 |
## Citation
|
| 307 |
|
| 308 |
+
If you use this model, please cite this model repository, the MMLA pose dataset, the associated CV4Animals workshop paper, and the underlying DINOv2 backbone.
|
|
|
|
|
|
|
|
|
|
| 309 |
|
| 310 |
**Model**
|
| 311 |
+
|
| 312 |
+
```bibtex
|
| 313 |
+
@software{imageomics_mmla_dino_pose_2026,
|
| 314 |
+
author = {Sun, Claire and Kline, Jenna and Pillai, Bharath and Berger-Wolf, Tanya},
|
| 315 |
+
title = {MMLA DINOv2 8-Class Animal Pose Classifier},
|
|
|
|
| 316 |
year = {2026},
|
| 317 |
+
url = {https://huggingface.co/imageomics/mmla-dino-pose},
|
| 318 |
+
note = {Fine-tuned on the MMLA pose dataset: https://huggingface.co/datasets/imageomics/mmla-pose}
|
| 319 |
}
|
| 320 |
```
|
|
|
|
| 321 |
|
| 322 |
+
**Dataset**
|
| 323 |
|
| 324 |
+
Please also cite the MMLA pose dataset:
|
| 325 |
+
```bibtex
|
| 326 |
+
@dataset{imageomics_mmla_pose_2026,
|
| 327 |
+
title = {MMLA Pose Dataset},
|
| 328 |
+
year = {2026},
|
| 329 |
+
url = {https://huggingface.co/datasets/imageomics/mmla-pose}
|
| 330 |
+
}
|
| 331 |
```
|
| 332 |
+
|
| 333 |
+
**Underlying backbone**
|
| 334 |
+
|
| 335 |
+
```bibtex
|
| 336 |
@article{oquab2023dinov2,
|
| 337 |
title = {DINOv2: Learning Robust Visual Features without Supervision},
|
| 338 |
author = {Oquab, Maxime and Darcet, Timoth{\'e}e and Moutakanni, Th{\'e}o and Vo, Huy V. and Szafraniec, Marc and Khalidov, Vasil and Fernandez, Pierre and Haziza, Daniel and Massa, Francisco and El-Nouby, Alaaeldin and others},
|