Instructions to use occurra/object_detection_segmentation with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- TensorRT
How to use occurra/object_detection_segmentation with TensorRT:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
occurra/object_detection_segmentation
ONNX exports of Ultralytics YOLOv11-seg
(instance segmentation) in the configurations the occurra
object_detection_segmentation agent ships with. Companion to
occurra/object_detection โ
same class set (person + bicycle + 4 vehicle subtypes), same naming
convention, same hardware-selection logic, with per-object pixel masks
on top of bounding boxes.
Nano size only (no small variant yet). Four precision variants. All files are self-contained (no external-data sidecars).
Filename convention
yolo11n-seg_{apple,fp16,fp8,int8}_640x640.onnx
| Token | Meaning |
|---|---|
n-seg |
YOLOv11 nano segmentation variant |
apple |
FP16, NMS-free, batch=1, static โ CoreML / Apple ANE friendly. uint8 input. |
fp16 |
FP16 weights, NMS embedded. Default for NVIDIA TensorRT EP. |
fp8 |
FP8 quantized via TensorRT QDQ. Smallest VRAM footprint on Blackwell / Hopper. |
int8 |
INT8 quantized with QDQ nodes embedded in the graph. No sidecar calibration cache needed. |
640x640 |
Square input โ same shape used by the upstream Ultralytics export. |
The object_detection_segmentation agent reads the input shape directly
from the loaded ONNX (graph.input[0].type) โ no sidecar config; the
file name is informational.
Which file to pick
| Hardware | Recommended |
|---|---|
| Apple Silicon (CoreML / ANE) | yolo11n-seg_apple_640x640.onnx |
| NVIDIA RTX 4000+ / Blackwell | yolo11n-seg_fp8_640x640.onnx |
| NVIDIA older (no FP8) | yolo11n-seg_int8_640x640.onnx |
| CPU fallback | yolo11n-seg_fp16_640x640.onnx |
The agent's _resolve_model_filename picks automatically based on
platform + GPU compute capability. Set
OBJECT_DETECTION_SEGMENTATION_MODEL=<filename> to force a specific
variant.
Outputs
Each ONNX has two outputs (Ultralytics-seg standard):
| Output | Shape | Contents |
|---|---|---|
output0 |
(batch, 4+80+32, N) |
[cx, cy, w, h] + 80 class scores + 32 mask coefficients per anchor |
output1 |
(batch, 32, proto_h, proto_w) |
Prototype masks; coeffs @ protos reconstructs the per-detection mask. |
The agent runs NMS in Python after filtering to the curated class set
(COCO 0/1/2/3/5/7 โ person, bicycle, car, motorcycle, bus, truck) and
decodes masks in YoloSegOnnx. Bitplane bytes are passed to the C++
toolbox for denoising + RLE encoding.
Source
Ultralytics yolo11n-seg.pt checkpoints downloaded from Ultralytics'
release feed and re-exported via the occurra toolbox's
ai_agent_toolbox/agents/python/object_detection_segmentation/scripts/main.py
(NMS-free for Apple, with-NMS for NVIDIA; FP8/INT8 use TensorRT QDQ).
License
The model weights inherit Ultralytics YOLOv11's AGPL-3.0 license. Commercial use requires a separate enterprise license from Ultralytics โ the ONNX export does not change that.
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js