| model verdict pixel max|Δ| feature max|Δ| (rel) | |
| openai/clip-vit-base-patch32 OK 7.53e-03 1.53e-02 (0.2%) | |
| google/vit-base-patch16-224 OK 3.93e-03 6.87e-03 (0.8%) | |
| apple/mobilevit-small SKIP: no normalize (rescale only) | |
| facebook/dinov2-small OK 1.41e-01 1.91e-02 (0.2%) | |
| google/siglip-so400m-patch14-384 OK 1.58e-01 9.28e-03 (0.1%) | |
| facebook/dinov3-vitb16-pretrain-lvd1689m OK 4.99e-05 8.08e-05 (0.0%) | |
| microsoft/swinv2-tiny-patch4-window16-256 OK 8.75e-03 1.27e-02 (0.6%) | |
| google/siglip2-base-patch16-224 OK 3.93e-03 1.31e-02 (0.2%) | |
| microsoft/resnet-50 SKIP: shortest_edge without crop (variable output) | |
| nvidia/segformer-b0-finetuned-ade-512-512 OK 8.75e-03 4.33e-02 (0.4%) | |
| facebook/convnextv2-tiny-22k-384 SKIP: shortest_edge without crop (variable output) | |
| google/mobilenet_v2_1.0_224 OK 3.92e-03 3.90e-02 (0.7%) | |
| facebook/convnext-tiny-224 SKIP: shortest_edge without crop (variable output) | |
| google/efficientnet-b0 SKIP: resample 0 | |
| microsoft/beit-base-patch16-224-pt22k-ft22k OK 3.93e-03 3.19e-02 (0.9%) | |