Image Classification
LiteRT
LiteRT
vision
snnn001 commited on
Commit
8b2ef4b
·
verified ·
1 Parent(s): 2a7e380

Add LiteRT converted vit_tiny_patch16_224

Browse files
Files changed (2) hide show
  1. README.md +61 -0
  2. model.tflite +3 -0
README.md ADDED
@@ -0,0 +1,61 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: litert
3
+ base_model: timm/vit_tiny_patch16_224.augreg_in21k_ft_in1k
4
+ tags:
5
+ - vision
6
+ - image-classification
7
+ datasets:
8
+ - imagenet-1k
9
+ ---
10
+
11
+ # vit_tiny_patch16_224
12
+
13
+ Converted TIMM image classification model for LiteRT.
14
+
15
+ - Source architecture: vit_tiny_patch16_224
16
+ - File: model.tflite
17
+
18
+ ## Model Details
19
+
20
+ - **Model Type:** Image classification / feature backbone
21
+ - **Model Stats:**
22
+ - Params (M): 5.7
23
+ - GMACs: 1.1
24
+ - Activations (M): 4.1
25
+ - Image size: 224 x 224
26
+ - **Papers:**
27
+ - How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers: https://arxiv.org/abs/2106.10270
28
+ - An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale: https://arxiv.org/abs/2010.11929v2
29
+ - **Dataset:** ImageNet-1k
30
+ - **Pretrain Dataset:** ImageNet-21k
31
+ - **Original:** https://github.com/google-research/vision_transformer
32
+
33
+ ## Citation
34
+
35
+ ```bibtex
36
+ @article{steiner2021augreg,
37
+ title={How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers},
38
+ author={Steiner, Andreas and Kolesnikov, Alexander and and Zhai, Xiaohua and Wightman, Ross and Uszkoreit, Jakob and Beyer, Lucas},
39
+ journal={arXiv preprint arXiv:2106.10270},
40
+ year={2021}
41
+ }
42
+ ```
43
+ ```bibtex
44
+ @article{dosovitskiy2020vit,
45
+ title={An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale},
46
+ author={Dosovitskiy, Alexey and Beyer, Lucas and Kolesnikov, Alexander and Weissenborn, Dirk and Zhai, Xiaohua and Unterthiner, Thomas and Dehghani, Mostafa and Minderer, Matthias and Heigold, Georg and Gelly, Sylvain and Uszkoreit, Jakob and Houlsby, Neil},
47
+ journal={ICLR},
48
+ year={2021}
49
+ }
50
+ ```
51
+ ```bibtex
52
+ @misc{rw2019timm,
53
+ author = {Ross Wightman},
54
+ title = {PyTorch Image Models},
55
+ year = {2019},
56
+ publisher = {GitHub},
57
+ journal = {GitHub repository},
58
+ doi = {10.5281/zenodo.4414861},
59
+ howpublished = {\url{https://github.com/huggingface/pytorch-image-models}}
60
+ }
61
+ ```
model.tflite ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c3f3d1bd653a0ff761f0c1edd38c425b9797d6c71bf778cc10f987f3f7009f1
3
+ size 22978000