medium (ONNX)

This is an ONNX export and quantisation of google/siglip2-base-patch16-224, not an original model.

All credit for the original model goes to its authors. This repo exists solely to host pre-exported ONNX variants for use by Scene Atlas, a tabletop RPG scene management tool.

Source Model

Original: google/siglip2-base-patch16-224
Family: siglip2
Embedding dimension: 768
Image size: 224x224

Quantisation

Encoder	Quantisation	Size
Vision	fp16	177.4 MB
Text	int8	290.6 MB
Total		468.0 MB

Model Parameters

Parameter	Value
`model_family`	`siglip2`
`embedding_dim`	`768`
`image_size`	`224`
`image_mean`	`0.5000, 0.5000, 0.5000`
`image_std`	`0.5000, 0.5000, 0.5000`
`interpolation`	`bilinear`
`resize_mode`	`direct_resize`
`tokenizer_type`	`sentencepiece`
`tokenizer_max_length`	`64`

Usage

These models are intended for use with Scene Atlas. The repo contains clip_vision_encoder.onnx, clip_text_encoder.onnx, manifest.json, and a tokenizer/ directory — all at the repo root.

from huggingface_hub import hf_hub_download

# Download encoder files
hf_hub_download(
    repo_id="jennis0/scene-atlas-medium",
    filename="clip_vision_encoder.onnx",
)

License

Please refer to the original model card for licensing information.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for jennis0/scene-atlas-medium

Base model

google/siglip2-base-patch16-224

Quantized

(8)

this model

Collection including jennis0/scene-atlas-medium

Scene Atlas Models

Collection

ONNX-exported and quantised vision-language models for Scene Atlas. These are derivative works — see individual model cards for attribution. • 3 items • Updated Mar 15