File size: 876 Bytes
1b557d0 f38ae12 1b557d0 f38ae12 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 |
---
license: apache-2.0
tags:
- self-supervised learning
- vision
- SiT
inference: false
---
# Model description
SiT is a self-supervised learning model that combines masked image modeling and contrastive learning. The model is trained on ImageNet-1K.
# Model Sources
- https://github.com/Sara-Ahmed/SiT
- https://arxiv.org/abs/2104.03602
# Model Card Authors
Sara Atito, Muhammad Awais, Josef Kittler
# How to use
```python
from modeling_sit import ViTSiTForPreTraining
# reload
model = ViTSiTForPreTraining.from_pretrained("erow/SiT")
```
# BibTeX entry and citation info
```
@inproceedings{atito2023sit,
title={SiT is all you need},
author={Atito, Sara and Awais, Muhammed and Nandam, Srinivasa and Kittler, Josef},
booktitle={2023 IEEE International Conference on Image Processing (ICIP)},
pages={2125--2129},
year={2023},
organization={IEEE}
}
``` |