clip-vit-large-patch14-336

This model was trained using the clip-vit-large-patch14-336 vision and HiDream 248 Long CLIP It was tested working in comfy. You will need to set training to ignore the mismatched length as CLIP has no definition for the model.

def load_clip_with_long_context(model_path, max_length=248): print(f"Loading CLIP model with max_length = {max_length}...")

text_config = CLIPTextConfig.from_pretrained(model_path)
text_config.max_position_embeddings = max_length

model = CLIPModel.from_pretrained(
    model_path,
    ignore_mismatched_sizes=True,
    torch_dtype=torch.float32,
    text_config=text_config,
)
Downloads last month
8
Safetensors
Model size
0.4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support