vietanhdev
/

segment-anything-onnx-models

@@ -1,101 +1,97 @@
----
-license: apache-2.0
-tags:
-  - image-segmentation
-  - segment-anything
-  - onnx
-  - onnxruntime
-library_name: onnxruntime
----
-# Segment Anything (SAM + MobileSAM) — ONNX Models
-ONNX-exported versions of Meta's [Segment Anything Model (SAM)](https://github.com/facebookresearch/segment-anything) and [MobileSAM](https://github.com/ChaoningZhang/MobileSAM), ready for CPU/GPU inference with [ONNX Runtime](https://onnxruntime.ai/) — no PyTorch required at runtime.
-These models are used by **[AnyLabeling](https://github.com/vietanhdev/anylabeling)** for AI-assisted image annotation, and exported by **[samexporter](https://github.com/vietanhdev/samexporter)**.
-## Available Models
-| File | Variant | Encoder size | Notes |
-|------|---------|-------------|-------|
-| `sam_vit_b_01ec64.zip` | SAM ViT-B | ~90 MB | Fastest, lowest accuracy |
-| `sam_vit_b_01ec64_quant.zip` | SAM ViT-B (Quant) | ~25 MB | Quantized — smaller & faster |
-| `sam_vit_l_0b3195.zip` | SAM ViT-L | ~330 MB | Good balance |
-| `sam_vit_l_0b3195_quant.zip` | SAM ViT-L (Quant) | ~83 MB | Quantized — smaller & faster |
-| `sam_vit_h_4b8939.zip` | SAM ViT-H | ~630 MB | Highest accuracy |
-| `sam_vit_h_4b8939_quant.zip` | SAM ViT-H (Quant) | ~158 MB | Quantized — smaller & faster |
-| `mobile_sam_20230629.zip` | MobileSAM | ~9 MB | Ultra-lightweight |
-Each zip contains two ONNX files: an **encoder** (runs once per image) and a **decoder** (runs interactively for each prompt).
-## Prompt Types
-- **Point** (`+point` / `-point`): click to include/exclude regions
-- **Rectangle**: draw a bounding box around the target object
-## Use with AnyLabeling (Recommended)
-[AnyLabeling](https://github.com/vietanhdev/anylabeling) is a desktop annotation tool with a built-in model manager that downloads, caches, and runs these models automatically — no coding required.
-1. Install: `pip install anylabeling`
-2. Launch: `anylabeling`
-3. Click the **Brain** button → select a SAM model from the dropdown
-4. Use point or rectangle prompts to segment objects
-[![AnyLabeling demo](https://user-images.githubusercontent.com/18329471/236625792-07f01838-3f69-48b0-a12e-30bad27bd921.gif)](https://github.com/vietanhdev/anylabeling)
-## Use Programmatically with ONNX Runtime
-```python
-import urllib.request, zipfile, pathlib
-# Download and extract
-url = "https://huggingface.co/vietanhdev/segment-anything-onnx-models/resolve/main/sam_vit_b_01ec64.zip"
-urllib.request.urlretrieve(url, "sam_vit_b_01ec64.zip")
-with zipfile.ZipFile("sam_vit_b_01ec64.zip") as z:
-    z.extractall("sam_vit_b_01ec64")
-```
-Then use [samexporter](https://github.com/vietanhdev/samexporter)'s inference module:
-```bash
-pip install samexporter
-python -m samexporter.inference \
-    --encoder_model sam_vit_b_01ec64/sam_vit_b_encoder.onnx \
-    --decoder_model sam_vit_b_01ec64/sam_vit_b_decoder.onnx \
-    --image photo.jpg \
-    --prompt prompt.json \
-    --output result.png
-```
-## Re-export from Source
-To re-export or customize the models using [samexporter](https://github.com/vietanhdev/samexporter):
-```bash
-pip install samexporter
-# Export SAM ViT-H encoder + decoder
-python -m samexporter.export_encoder \
-    --checkpoint original_models/sam_vit_h_4b8939.pth \
-    --output output_models/sam_vit_h_4b8939.encoder.onnx \
-    --model-type vit_h --use-preprocess
-python -m samexporter.export_decoder \
-    --checkpoint original_models/sam_vit_h_4b8939.pth \
-    --output output_models/sam_vit_h_4b8939.decoder.onnx \
-    --model-type vit_h --return-single-mask
-# Or convert all SAM variants at once:
-bash convert_all_meta_sam.sh
-```
-## Related Repositories
-| Repo | Description |
-|------|-------------|
-| [vietanhdev/samexporter](https://github.com/vietanhdev/samexporter) | Export scripts, inference code, conversion tools |
-| [vietanhdev/anylabeling](https://github.com/vietanhdev/anylabeling) | Desktop annotation app powered by these models |
-| [facebookresearch/segment-anything](https://github.com/facebookresearch/segment-anything) | Original SAM by Meta |
-| [ChaoningZhang/MobileSAM](https://github.com/ChaoningZhang/MobileSAM) | Original MobileSAM |
-## License
-The ONNX models are derived from Meta's SAM and MobileSAM, both released under the **Apache 2.0** license.
-The export code is part of [samexporter](https://github.com/vietanhdev/samexporter), released under the **MIT** license.

+---
+license: apache-2.0
+pipeline_tag: image-segmentation
+library_name: onnx
+tags:
+  - onnxruntime
+  - onnx
+  - segment-anything
+  - image-segmentation
+  - edge-ai
+  - anylabeling
+authors:
+  - Viet-Anh Nguyen
+---
+# Segment Anything (SAM) — ONNX Models
+ONNX exports of Meta's original [Segment Anything](https://github.com/facebookresearch/segment-anything) family, plus [MobileSAM](https://github.com/ChaoningZhang/MobileSAM), packaged for direct use with [`onnxruntime`](https://onnxruntime.ai) and [AnyLabeling](https://github.com/vietanhdev/anylabeling).
+## Why this repo exists
+Running SAM through the original PyTorch checkpoint is heavy on a CPU laptop or an edge device. ONNX gives you a portable, dependency-light runtime that works in Python, C++, JavaScript, and most embedded targets. These exports are the ones AnyLabeling consumes for its smart-labeling features.
+## Variants
+Each `.zip` bundles the encoder + decoder ONNX files for that backbone.
+| File | Backbone | Size | Notes |
+|---|---|---|---|
+| `mobile_sam_20230629.zip` | MobileSAM | 35 MB | Smallest — best for mobile / low-power |
+| `mobile_sam_20230629_quant.zip` | MobileSAM | 10.5 MB | Quantized MobileSAM |
+| `sam_vit_b_01ec64.zip` | ViT-B | 332 MB | Base |
+| `sam_vit_b_01ec64_quant.zip` | ViT-B | 72 MB | Quantized base |
+| `sam_vit_l_0b3195.zip` | ViT-L | 1.1 GB | Large |
+| `sam_vit_l_0b3195_quant.zip` | ViT-L | 213 MB | Quantized large |
+| `sam_vit_h_4b8939.zip` | ViT-H | 2.3 GB | Huge — best quality |
+| `sam_vit_h_4b8939_quant.zip` | ViT-H | 422 MB | Quantized huge |
+## Quick start
+```bash
+pip install huggingface_hub onnxruntime
+```
+```python
+from huggingface_hub import hf_hub_download
+import zipfile, onnxruntime as ort
+zip_path = hf_hub_download(repo_id="vietanhdev/segment-anything-onnx-models",
+                           filename="sam_vit_b_01ec64_quant.zip")
+with zipfile.ZipFile(zip_path) as z:
+    z.extractall("./sam_vit_b_quant")
+session = ort.InferenceSession("./sam_vit_b_quant/encoder.onnx",
+                               providers=["CPUExecutionProvider"])
+# Inspect expected inputs:
+print([(i.name, i.shape, i.type) for i in session.get_inputs()])
+```
+For the full image → mask pipeline (encoder + decoder + prompt handling), see how AnyLabeling wires it: <https://github.com/vietanhdev/anylabeling>
+## Use with AnyLabeling
+These models drop into AnyLabeling's auto-labeling backend without conversion. See the [AnyLabeling docs](https://github.com/vietanhdev/anylabeling) for the model-config wiring.
+## Source weights
+- Original SAM weights & license: <https://github.com/facebookresearch/segment-anything>
+- MobileSAM: <https://github.com/ChaoningZhang/MobileSAM>
+This repo redistributes the same weights in ONNX format. License unchanged from upstream releases (Apache 2.0).
+## Citation
+```bibtex
+@misc{nguyen2026sam_onnx,
+  author = {Nguyen, Viet-Anh and {Neural Research Lab}},
+  title  = {Segment Anything ONNX Models},
+  year   = {2026},
+  url    = {https://huggingface.co/vietanhdev/segment-anything-onnx-models}
+}
+```
+For the underlying model, cite Meta's original SAM paper:
+```bibtex
+@article{kirillov2023sam,
+  title   = {Segment Anything},
+  author  = {Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C. and Lo, Wan-Yen and Doll{\'a}r, Piotr and Girshick, Ross},
+  journal = {arXiv:2304.02643},
+  year    = {2023}
+}
+```
+## Acknowledgments
+Thanks to Meta AI Research for releasing the SAM family, and to the MobileSAM team for their efficient distillation. This repo packages their work for edge inference.