Sreekanth3096 commited on
Commit
e45a2a4
·
verified ·
1 Parent(s): 2a1df1b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -18
README.md CHANGED
@@ -1,21 +1,3 @@
1
- ---
2
- license: mit
3
- datasets:
4
- - ILSVRC/imagenet-1k
5
- language:
6
- - en
7
- tags:
8
- - image-classification
9
- - transformers
10
- - vit
11
- - vision transformer
12
- - tensorflow
13
- - vision
14
- metrics:
15
- - bleu
16
- library_name: transformers
17
- pipeline_tag: image-classification
18
- ---
19
 
20
  Model Overview:
21
  The Vision Transformer (ViT) is a transformer encoder model designed for image recognition tasks. It was pretrained on a large dataset of 14 million images and 21,843 classes known as ImageNet-21k, and fine-tuned on ImageNet 2012, which consists of 1 million images across 1,000 classes.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
 
2
  Model Overview:
3
  The Vision Transformer (ViT) is a transformer encoder model designed for image recognition tasks. It was pretrained on a large dataset of 14 million images and 21,843 classes known as ImageNet-21k, and fine-tuned on ImageNet 2012, which consists of 1 million images across 1,000 classes.