How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="OmkarShidore/scene-caption")
# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM

tokenizer = AutoTokenizer.from_pretrained("OmkarShidore/scene-caption")
model = AutoModelForMultimodalLM.from_pretrained("OmkarShidore/scene-caption")
Quick Links
README.md exists but content is empty.
Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using OmkarShidore/scene-caption 1