nightknocker's picture
Update README.md
cc9bbec verified
|
raw
history blame
209 Bytes
metadata
license: apache-2.0
tags:
  - vit

Vision Transformer (base-sized model)

Random weights are provided for the ViT model. During each step, the model selects a random subset of the masked image patches.