File size: 876 Bytes
1b557d0
 
f38ae12
 
 
 
 
 
1b557d0
f38ae12
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
---
license: apache-2.0
tags:
- self-supervised learning
- vision
- SiT
inference: false

---

# Model description

SiT is a self-supervised learning model that combines masked image modeling and contrastive learning. The model is trained on ImageNet-1K.


# Model Sources

- https://github.com/Sara-Ahmed/SiT
- https://arxiv.org/abs/2104.03602


# Model Card Authors 
Sara Atito, Muhammad Awais, Josef Kittler

# How to use

```python
from modeling_sit import ViTSiTForPreTraining
# reload
model = ViTSiTForPreTraining.from_pretrained("erow/SiT")
```

# BibTeX entry and citation info
```
@inproceedings{atito2023sit,
  title={SiT is all you need},
  author={Atito, Sara and Awais, Muhammed and Nandam, Srinivasa and Kittler, Josef},
  booktitle={2023 IEEE International Conference on Image Processing (ICIP)},
  pages={2125--2129},
  year={2023},
  organization={IEEE}
}
```