Image Classification
LiteRT
LiteRT
vision
snnn001 commited on
Commit
51a2552
·
verified ·
1 Parent(s): 4ca2fb4

Add LiteRT converted maxvit_tiny_rw_224

Browse files
Files changed (2) hide show
  1. README.md +66 -0
  2. model.tflite +3 -0
README.md ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: litert
3
+ base_model: timm/maxvit_tiny_rw_224.sw_in1k
4
+ tags:
5
+ - vision
6
+ - image-classification
7
+ datasets:
8
+ - imagenet-1k
9
+ ---
10
+ # maxvit_tiny_rw_224
11
+
12
+ Converted TIMM image classification model for LiteRT.
13
+
14
+ - Source architecture: `maxvit_tiny_rw_224`
15
+ - Source checkpoint: `timm/maxvit_tiny_rw_224.sw_in1k`
16
+ - File: `model.tflite`
17
+ - Input: `float32` tensor in NCHW layout, shape `[1, 3, 224, 224]`
18
+ - Output: ImageNet-1K logits, shape `[1, 1000]`
19
+
20
+ ## Runtime Status
21
+
22
+ - CPU smoke test: passed with LiteRT `CompiledModel`.
23
+ - GPU delegation: currently blocked for this model by rank-5 tensor patterns in the GPU backend, mostly `RESHAPE`, `TRANSPOSE`, and related window/attention operations. The model is published as CPU-ready while GPU support is being improved.
24
+
25
+ ## Model Details
26
+
27
+ - **Model Type:** Image classification / feature backbone
28
+ - **Model Stats:**
29
+ - Params (M): 29.1
30
+ - GMACs: 5.1
31
+ - Activations (M): 33.1
32
+ - Image size: 224 x 224
33
+ - **Papers:**
34
+ - MaxViT: Multi-Axis Vision Transformer: https://arxiv.org/abs/2204.01697
35
+ - **Dataset:** ImageNet-1k
36
+
37
+ ## Citation
38
+
39
+ ```bibtex
40
+ @misc{rw2019timm,
41
+ author = {Ross Wightman},
42
+ title = {PyTorch Image Models},
43
+ year = {2019},
44
+ publisher = {GitHub},
45
+ journal = {GitHub repository},
46
+ doi = {10.5281/zenodo.4414861},
47
+ howpublished = {\url{https://github.com/huggingface/pytorch-image-models}}
48
+ }
49
+ ```
50
+ ```bibtex
51
+ @article{tu2022maxvit,
52
+ title={MaxViT: Multi-Axis Vision Transformer},
53
+ author={Tu, Zhengzhong and Talebi, Hossein and Zhang, Han and Yang, Feng and Milanfar, Peyman and Bovik, Alan and Li, Yinxiao},
54
+ journal={ECCV},
55
+ year={2022},
56
+ }
57
+ ```
58
+ ```bibtex
59
+ @article{dai2021coatnet,
60
+ title={CoAtNet: Marrying Convolution and Attention for All Data Sizes},
61
+ author={Dai, Zihang and Liu, Hanxiao and Le, Quoc V and Tan, Mingxing},
62
+ journal={arXiv preprint arXiv:2106.04803},
63
+ year={2021}
64
+ }
65
+ ```
66
+
model.tflite ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d912f15f864f3b4ccdedfb8fb21f3658aba060914d2a75420c5b277b4eafc9c4
3
+ size 117852416