---
license: apache-2.0
base_model: dhkim2810/MobileSAM
tags:
  - mask-generation
  - vision.cpp
pipeline_tag: image-segmentation
---
# GGUF models for MobileSAM

MobileSAM is a model for image segmentation. It generates object masks from
point or box prompts. The weights in this repository are converted for
lightweight inference on consumer hardware with
[vision.cpp](https://github.com/Acly/vision.cpp).

* Original repository: [ChaoningZhang/MobileSAM (Github)](https://github.com/ChaoningZhang/MobileSAM)
* Original weights: [dhkim2810/MobileSAM (HuggingFace)](https://huggingface.co/dhkim2810/MobileSAM)

## Run

Example inference with [vision.cpp](https://github.com/Acly/vision.cpp):

```sh
vision-cli sam -m MobileSAM-F16.gguf -i input.png -p 256 480 -o mask.png --composite output.png
```

## Models

| Model                                    | Description                                                      |
| ---------------------------------------- | ---------------------------------------------------------------- |
| [MobileSAM-F16.gguf](MobileSAM-F16.gguf) | Encoder + decoder, fused batch norm, NHWC memory layout, float16 |