physic76 commited on
Commit
c5bb601
·
verified ·
1 Parent(s): 2957f6f

Initial upload: yolo11n-seg ONNX (apple/fp16/fp8/int8 × 640x640)

Browse files
README.md ADDED
@@ -0,0 +1,86 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: agpl-3.0
3
+ library_name: onnx
4
+ tags:
5
+ - yolo
6
+ - yolov11
7
+ - object-detection
8
+ - instance-segmentation
9
+ - onnx
10
+ - tensorrt
11
+ pipeline_tag: image-segmentation
12
+ ---
13
+
14
+ # occurra/object_detection_segmentation
15
+
16
+ ONNX exports of [Ultralytics YOLOv11-seg](https://github.com/ultralytics/ultralytics)
17
+ (instance segmentation) in the configurations the occurra
18
+ `object_detection_segmentation` agent ships with. Companion to
19
+ [`occurra/object_detection`](https://huggingface.co/occurra/object_detection) —
20
+ same class set (person + bicycle + 4 vehicle subtypes), same naming
21
+ convention, same hardware-selection logic, with per-object pixel masks
22
+ on top of bounding boxes.
23
+
24
+ Nano size only (no small variant yet). Four precision variants. All
25
+ files are self-contained (no external-data sidecars).
26
+
27
+ ## Filename convention
28
+
29
+ ```
30
+ yolo11n-seg_{apple,fp16,fp8,int8}_640x640.onnx
31
+ ```
32
+
33
+ | Token | Meaning |
34
+ | ----- | ------- |
35
+ | `n-seg` | YOLOv11 nano segmentation variant |
36
+ | `apple` | FP16, NMS-free, batch=1, static — CoreML / Apple ANE friendly. uint8 input. |
37
+ | `fp16` | FP16 weights, NMS embedded. Default for NVIDIA `TensorRT` EP. |
38
+ | `fp8` | FP8 quantized via TensorRT QDQ. Smallest VRAM footprint on Blackwell / Hopper. |
39
+ | `int8` | INT8 quantized with QDQ nodes embedded in the graph. No sidecar calibration cache needed. |
40
+ | `640x640` | Square input — same shape used by the upstream Ultralytics export. |
41
+
42
+ The `object_detection_segmentation` agent reads the input shape directly
43
+ from the loaded ONNX (`graph.input[0].type`) — no sidecar config; the
44
+ file name is informational.
45
+
46
+ ## Which file to pick
47
+
48
+ | Hardware | Recommended |
49
+ | -------- | ----------- |
50
+ | Apple Silicon (CoreML / ANE) | `yolo11n-seg_apple_640x640.onnx` |
51
+ | NVIDIA RTX 4000+ / Blackwell | `yolo11n-seg_fp8_640x640.onnx` |
52
+ | NVIDIA older (no FP8) | `yolo11n-seg_int8_640x640.onnx` |
53
+ | CPU fallback | `yolo11n-seg_fp16_640x640.onnx` |
54
+
55
+ The agent's `_resolve_model_filename` picks automatically based on
56
+ platform + GPU compute capability. Set
57
+ `OBJECT_DETECTION_SEGMENTATION_MODEL=<filename>` to force a specific
58
+ variant.
59
+
60
+ ## Outputs
61
+
62
+ Each ONNX has two outputs (Ultralytics-seg standard):
63
+
64
+ | Output | Shape | Contents |
65
+ | ------ | ----- | -------- |
66
+ | `output0` | `(batch, 4+80+32, N)` | `[cx, cy, w, h]` + 80 class scores + 32 mask coefficients per anchor |
67
+ | `output1` | `(batch, 32, proto_h, proto_w)` | Prototype masks; `coeffs @ protos` reconstructs the per-detection mask. |
68
+
69
+ The agent runs NMS in Python after filtering to the curated class set
70
+ (COCO 0/1/2/3/5/7 → person, bicycle, car, motorcycle, bus, truck) and
71
+ decodes masks in `YoloSegOnnx`. Bitplane bytes are passed to the C++
72
+ toolbox for denoising + RLE encoding.
73
+
74
+ ## Source
75
+
76
+ Ultralytics `yolo11n-seg.pt` checkpoints downloaded from Ultralytics'
77
+ release feed and re-exported via the occurra toolbox's
78
+ `ai_agent_toolbox/agents/python/object_detection_segmentation/scripts/main.py`
79
+ (NMS-free for Apple, with-NMS for NVIDIA; FP8/INT8 use TensorRT QDQ).
80
+
81
+ ## License
82
+
83
+ The model weights inherit Ultralytics YOLOv11's
84
+ [AGPL-3.0](https://github.com/ultralytics/ultralytics/blob/main/LICENSE)
85
+ license. Commercial use requires a separate enterprise license from
86
+ Ultralytics — the ONNX export does not change that.
yolo11n-seg_apple_640x640.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d160d5c6c2ca4917a38440d56be63e7a35b993e077a24abd6e0ad612f510b566
3
+ size 5942872
yolo11n-seg_fp16_640x640.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6bde1f6003a4e997c3ccf2a087a703c7fdc54b734788655b78e2a87d8b15616e
3
+ size 6368251
yolo11n-seg_fp8_640x640.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac9829fb7047bb6520fa9c51d9da9b24607ac8ca75590727f1314d626c9a51e3
3
+ size 6527326
yolo11n-seg_int8_640x640.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:de42f8c1c9c263b97d802c218f8b0319980cb0c6eeb5f257592459d88a76247b
3
+ size 6561536