DINOHash / README.md
yushg's picture
Upload README.md with huggingface_hub
d975b60 verified
|
Raw
History Blame Contribute Delete
1.74 kB
---
license: mit
tags:
- onnx
- perceptual-hashing
- dinohash
---
# DINOHash — extra checkpoints
Additional DINOHash perceptual-hashing models not present in
[`backslashh/DINOHash`](https://huggingface.co/backslashh/DINOHash).
Each model is provided both as the **raw** training/traced artifact (`raw/`)
and as an exported **ONNX** graph (repo root, dynamic batch axis, opset 17).
| Model | ONNX | Raw | Notes |
|---|---|---|---|
| ViT-Small → ViT-Tiny (DINO distill) | `ViT-Small-ViT-Tiny.onnx` | `raw/ViT-Small-ViT-Tiny.pth` | student backbone (`vit_tiny_patch16_224`), 192-d embedding |
| XCiT-Small → XCiT-Tiny (DINO distill) | `XCiT-Small-XCiT-Tiny.onnx` | `raw/XCiT-Small-XCiT-Tiny.pth` | student backbone (`xcit_tiny_12_p16_224`), 192-d embedding |
| MAE-Lite mae_tiny_400e | `mae_tiny_400e_traced.onnx` | `raw/mae_tiny_400e_traced.pt` | 192-d |
| MAE-Lite mae_tiny_distill_400e | `mae_tiny_distill_400e_traced.onnx` | `raw/mae_tiny_distill_400e_traced.pt` | 192-d |
| MAE-Lite mae_tiny_distill_d2_400e | `mae_tiny_distill_d2_400e_traced.onnx` | `raw/mae_tiny_distill_d2_400e_traced.pt` | 192-d |
| MAE-Lite mocov3_tiny_400e | `mocov3_tiny_400e_traced.onnx` | `raw/mocov3_tiny_400e_traced.pt` | 192-d |
## Notes on the raw files
- **MAE-Lite** raw files are TorchScript (`_traced.pt`), self-contained and loadable directly.
- **ViT / XCiT** raw files are full DINO training checkpoints (`student`/`teacher`/`optimizer`/...).
The ONNX graphs were built by extracting the `student.backbone.*` weights into the matching
`timm` architecture (strict-clean load) and exporting; XCiT required `pos_embeder`→`pos_embed`
rename and qkv split/fuse between class-attention and XCA blocks.
All inputs are `(batch, 3, 224, 224)`.