Commit History

mask_decoder: replace rank-5 GatherElements with one-hot Mul+ReduceSum selection (GatherElements generates invalid WGSL in ort-web WebGPU EP). Bit-identical outputs verified vs previous decoder.
3b2984d
verified

square-zero-labs commited on

vision_encoder: precompute static pos-embed (removes If/Tile constructs that broke ort-web strict shape inference; loads at graphOptimizationLevel 'all' now). Revalidated: worst frame IoU 0.9967
028ef32
verified

square-zero-labs commited on

SAM2.1-tiny full video tracking pipeline (validated ONNX export, worst frame IoU 0.9967 vs PyTorch reference)
a4144e8
verified

square-zero-labs commited on