RichardScottOZ's picture
readme note
1c2768d verified
metadata
license: mit
language:
  - en
base_model:
  - google/vit-base-patch16-224
  - FacebookAI/roberta-base
tags:
  - comics
  - composition
  - comic
  - comic-analysis
  - page
  - fusion

The model code and documentation repository is at https://github.com/RichardScottOZ/comic-analysis

Using transformers multimodal fusion of image and text to make embeddings to query comics for similarity or text.

More more detail the repo above.