How to use shadowlilac/visor with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="shadowlilac/visor")
# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("shadowlilac/visor") model = AutoModelForImageTextToText.from_pretrained("shadowlilac/visor")
Visor is a natural-language-based image tagging model based on the BLIP model architecture.
Potential Use cases can be to caption anime images for training diffusion models
Files info