tanganke/dtd
Viewer • Updated • 5.64k • 20.5k • 4
How to use tanganke/clip-vit-base-patch32_dtd with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("feature-extraction", model="tanganke/clip-vit-base-patch32_dtd") # Load model directly
from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("tanganke/clip-vit-base-patch32_dtd")
model = AutoModel.from_pretrained("tanganke/clip-vit-base-patch32_dtd")Adam Optimizer with a constant learning rate 1e-5 for 4000 steps training (batch_size=32). Only the vision encoder is fine-tuned.
load vision model
from transformers import CLIPVisionModel
vision_model = CLIPVisionModel.from_pretrained('tanganke/clip-vit-base-patch32_dtd')
substitute the vision encoder of clip
from transformers import CLIPModel
clip_model = CLIPModel.from_pretrained("openai/clip-vit-base-patch32")
clip_model.vision_model.load_state_dict(vision_model.vision_model.state_dict())
Base model
openai/clip-vit-base-patch32