large (ONNX)

This is an ONNX export and quantisation of google/siglip2-so400m-patch14-384, not an original model.

All credit for the original model goes to its authors. This repo exists solely to host pre-exported ONNX variants for use by Scene Atlas, a tabletop RPG scene management tool.

Source Model

Original: google/siglip2-so400m-patch14-384
Family: siglip2
Embedding dimension: 1152
Image size: 384x384

Quantisation

Encoder	Quantisation	Size
Vision	fp16	817.3 MB
Text	fp16	1350.5 MB
Total		2167.8 MB

Model Parameters

Parameter	Value
`model_family`	`siglip2`
`embedding_dim`	`1152`
`image_size`	`384`
`image_mean`	`0.5000, 0.5000, 0.5000`
`image_std`	`0.5000, 0.5000, 0.5000`
`interpolation`	`bilinear`
`resize_mode`	`direct_resize`
`tokenizer_type`	`sentencepiece`
`tokenizer_max_length`	`64`

Usage

These models are intended for use with Scene Atlas. The repo contains clip_vision_encoder.onnx, clip_text_encoder.onnx, manifest.json, and a tokenizer/ directory — all at the repo root.

from huggingface_hub import hf_hub_download

# Download encoder files
hf_hub_download(
    repo_id="jennis0/scene-atlas-large",
    filename="clip_vision_encoder.onnx",
)

License

Please refer to the original model card for licensing information.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for jennis0/scene-atlas-large

Base model

google/siglip2-so400m-patch14-384

Quantized

(6)

this model

Collection including jennis0/scene-atlas-large

Scene Atlas Models

Collection

ONNX-exported and quantised vision-language models for Scene Atlas. These are derivative works — see individual model cards for attribution. • 3 items • Updated Mar 15