# Model Description

This is a ViT based style classifier used for Pony V7 captioning.

See the [captioning colab](https://colab.research.google.com/drive/19PG-0ltob8EynxUZSwOdjMFmqyJ7ZOCB#scrollTo=1ZwQT6sZaJpE) for usage details.