File size: 422 Bytes
992296a a0f3a8c 143ce55 1c2768d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 |
---
license: mit
language:
- en
base_model:
- google/vit-base-patch16-224
- FacebookAI/roberta-base
tags:
- comics
- composition
- comic
- comic-analysis
- page
- fusion
---
The model code and documentation repository is at https://github.com/RichardScottOZ/comic-analysis
Using transformers multimodal fusion of image and text to make embeddings to query comics for similarity or text.
More more detail the repo above. |