| pipeline_tag: image-to-text | |
| tags: | |
| - image-captioning | |
| - anime | |
| license: other | |
| license_name: shadowlilac-extension-bsd-3 | |
| license_link: LICENSE | |
| datasets: | |
| - shadowlilac/anime | |
| # Visor - Natural language Anime Tagging | |
| Visor is a natural-language-based image tagging model based on the BLIP model architecture. | |
| Potential Use cases can be to caption anime images for training diffusion models |