How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("zero-shot-image-classification", model="dleemiller/siglip2-math-base-patch16-256")
pipe(
    "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/parrots.png",
    candidate_labels=["animals", "humans", "landscape"],
)
# Load model directly
from transformers import AutoProcessor, AutoModelForZeroShotImageClassification

processor = AutoProcessor.from_pretrained("dleemiller/siglip2-math-base-patch16-256")
model = AutoModelForZeroShotImageClassification.from_pretrained("dleemiller/siglip2-math-base-patch16-256")
Quick Links

SigLip2 Math

This version of siglip2 is fine tuned on shiwk24/MathCanvas-Imagen using the code_derived_captions split. I trained for 1 epoch on 4M math images, with a random selection between the tikz code or caption using a batch size of 640.

This is not a classification model, since the loss function was pairwise contrastive loss. Use for embedding or downstream classifier training is recommended.

image

Downloads last month
6
Safetensors
Model size
0.4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for dleemiller/siglip2-math-base-patch16-256

Finetuned
(119)
this model

Dataset used to train dleemiller/siglip2-math-base-patch16-256