licensy commited on
Commit
3dc0690
·
verified ·
1 Parent(s): b981f4e

scorevision: push artifact

Browse files
Files changed (1) hide show
  1. miner.py +9 -8
miner.py CHANGED
@@ -3,15 +3,16 @@
3
  Base weights: plate_v3 (YOLO26s fine-tuned on Roboflow-filtered + 10x live pseudo-GT,
4
  resumed from plate_v2). fp16 end2end ONNX, static 1x3x1280x1280, ~19.4 MB.
5
 
6
- Inference pipeline (tuned per bench_v2.py + live observation on first 5 shards):
7
- - Single full-image pass with soft-NMS + hflip TTA
 
8
  - Recall-biased preset: conf=0.22, iou=0.41, sigma=0.685, max_det=22
9
- - No tile fallback (v3's recall is already high without tiles)
10
 
11
- Bench on 184-shard live pseudo-GT pool (/mnt/shadeform-data/plate_research/live_gt/):
12
- gated=0.440 mAP=0.978 (highest) fp/img=0.38 ms_p95=157
13
- Switched from c30 after live shard be77593656fa: we scored 0.168 (mAP 0.500)
14
- while competitors hit 0.318 (mAP 0.750) missed borderline-conf plates.
15
  Compared to:
16
  plate_v2 best: gated=0.424
17
  hermestech best: gated=0.422
@@ -127,7 +128,7 @@ class Miner:
127
  self.iou_thres = 0.41
128
  self.sigma = 0.685
129
  self.max_det = 22
130
- self.use_tta = True
131
 
132
  print(f"ONNX model loaded from: {model_path}")
133
  print(f"ONNX providers: {self.session.get_providers()}")
 
3
  Base weights: plate_v3 (YOLO26s fine-tuned on Roboflow-filtered + 10x live pseudo-GT,
4
  resumed from plate_v2). fp16 end2end ONNX, static 1x3x1280x1280, ~19.4 MB.
5
 
6
+ Inference pipeline (recall-biased, latency-optimized):
7
+ - Single full-image forward pass + soft-NMS (NO hflip TTA — drop saves ~1s
8
+ to reduce p95 variance; TEE chute sees 10s gate, we need headroom)
9
  - Recall-biased preset: conf=0.22, iou=0.41, sigma=0.685, max_det=22
10
+ - No tile fallback
11
 
12
+ Bench (c=0.22 without TTA, estimated): gated ≈ 0.436, mAP ≈ 0.975
13
+ Trade +0.005 gated for ~1s faster median / safer p95 vs 10s validator gate.
14
+ Prior LAT events (p95 11.4s and 99s cold-start) showed tail events dominate
15
+ when median is 2-3s; cutting 1 forward pass pulls median to ~1.5s.
16
  Compared to:
17
  plate_v2 best: gated=0.424
18
  hermestech best: gated=0.422
 
128
  self.iou_thres = 0.41
129
  self.sigma = 0.685
130
  self.max_det = 22
131
+ self.use_tta = False # disabled: single forward pass, half the latency
132
 
133
  print(f"ONNX model loaded from: {model_path}")
134
  print(f"ONNX providers: {self.session.get_providers()}")