scorevision: push artifact
Browse files
miner.py
CHANGED
|
@@ -3,15 +3,16 @@
|
|
| 3 |
Base weights: plate_v3 (YOLO26s fine-tuned on Roboflow-filtered + 10x live pseudo-GT,
|
| 4 |
resumed from plate_v2). fp16 end2end ONNX, static 1x3x1280x1280, ~19.4 MB.
|
| 5 |
|
| 6 |
-
Inference pipeline (
|
| 7 |
-
- Single full-image pass
|
|
|
|
| 8 |
- Recall-biased preset: conf=0.22, iou=0.41, sigma=0.685, max_det=22
|
| 9 |
-
- No tile fallback
|
| 10 |
|
| 11 |
-
Bench
|
| 12 |
-
|
| 13 |
-
|
| 14 |
-
|
| 15 |
Compared to:
|
| 16 |
plate_v2 best: gated=0.424
|
| 17 |
hermestech best: gated=0.422
|
|
@@ -127,7 +128,7 @@ class Miner:
|
|
| 127 |
self.iou_thres = 0.41
|
| 128 |
self.sigma = 0.685
|
| 129 |
self.max_det = 22
|
| 130 |
-
self.use_tta =
|
| 131 |
|
| 132 |
print(f"ONNX model loaded from: {model_path}")
|
| 133 |
print(f"ONNX providers: {self.session.get_providers()}")
|
|
|
|
| 3 |
Base weights: plate_v3 (YOLO26s fine-tuned on Roboflow-filtered + 10x live pseudo-GT,
|
| 4 |
resumed from plate_v2). fp16 end2end ONNX, static 1x3x1280x1280, ~19.4 MB.
|
| 5 |
|
| 6 |
+
Inference pipeline (recall-biased, latency-optimized):
|
| 7 |
+
- Single full-image forward pass + soft-NMS (NO hflip TTA — drop saves ~1s
|
| 8 |
+
to reduce p95 variance; TEE chute sees 10s gate, we need headroom)
|
| 9 |
- Recall-biased preset: conf=0.22, iou=0.41, sigma=0.685, max_det=22
|
| 10 |
+
- No tile fallback
|
| 11 |
|
| 12 |
+
Bench (c=0.22 without TTA, estimated): gated ≈ 0.436, mAP ≈ 0.975
|
| 13 |
+
Trade +0.005 gated for ~1s faster median / safer p95 vs 10s validator gate.
|
| 14 |
+
Prior LAT events (p95 11.4s and 99s cold-start) showed tail events dominate
|
| 15 |
+
when median is 2-3s; cutting 1 forward pass pulls median to ~1.5s.
|
| 16 |
Compared to:
|
| 17 |
plate_v2 best: gated=0.424
|
| 18 |
hermestech best: gated=0.422
|
|
|
|
| 128 |
self.iou_thres = 0.41
|
| 129 |
self.sigma = 0.685
|
| 130 |
self.max_det = 22
|
| 131 |
+
self.use_tta = False # disabled: single forward pass, half the latency
|
| 132 |
|
| 133 |
print(f"ONNX model loaded from: {model_path}")
|
| 134 |
print(f"ONNX providers: {self.session.get_providers()}")
|