File size: 4,759 Bytes
0e2c15c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
# Methodology

## Problem Framing

Classify an input image into one of {rust, zinc, normal} using fast, interpretable color cues. We rely on CIELab chroma tendencies:

Rustish surfaces skew a* positive (reddish/brownish).

Zincish surfaces skew b* positive (yellowish).

The method is heuristic (no material guarantees) but is fast, parameter-tunable, and explainable.

## Pipeline Overview

Input → Preprocess → Color Analysis → Heuristic Ratios → Rule-based Classification → (Optional) Palette Visualization

Read & (optional) Resize

Load as BGR (cv.imread / bytes→cv.imdecode).

If max(H, W) > resize_max, downscale with INTER_AREA to speed up.

Color Space Transforms

BGR → RGB, BGR → HSV, BGR → Lab (OpenCV 8-bit Lab).

Compute per-space statistics (channel means/stds) for explainability.

Dominant Colors (K-Means)

Flatten pixels to N×3 (BGR float32).

Run cv.kmeans with KMEANS_PP_CENTERS, termination criteria (eps + max_iter).

Sort clusters by frequency to produce a palette (with shares) and a palette image.

Heuristic Indicators (Lab)

Split Lab: L*, a*, b*.

Compute medians: a_med, b_med.

Define thresholds:
a_thr = a_med + Δ, b_thr = b_med + Δ where Δ = lab_delta (default 6.0).

Ratios:
rustish_ratio = mean(a* > a_thr)
zincish_ratio = mean(b* > b_thr)

Rule-based Classification

If zincish_ratio > zinc_thr ⇒ zinc

Else if rustish_ratio > rust_thr ⇒ rust

Else ⇒ normal

Outputs

Class label, rustish_ratio, zincish_ratio

Top k RGB colors and their shares

(CLI) JSON report + palette image

## Parameters & Defaults

k (clusters): 3 — palette granularity vs. speed.

lab_delta (Δ): 6.0 — sensitivity around median; higher = stricter.

rust_thr: 0.01 — minimum fraction of above-threshold a* pixels.

zinc_thr: 0.02 — minimum fraction of above-threshold b* pixels.

resize_max: 1200 — longest side cap for speed (CLI).

Tuning tips:
• Increase lab_delta to reduce false positives.
• Decrease rust_thr/zinc_thr to increase sensitivity.
• Keep k=3..5 unless you need very fine palettes.

## Implementation Mapping

FastAPI (api.py): /classify/ accepts an image + query params → returns JSON with classification, ratios, and palette.

Gradio (gradio_app.py): Interactive UI with sliders for k, thresholds, and lab_delta.

CLI (reports):

color.py / main.py → stats + palette + heuristics (+ classification in color.py).

classify.py → reads a *_color_report.json and applies thresholds to output a label.

All core logic is shared: color conversions, K-Means, Lab heuristics, and the rule-based decision.

## Complexity & Performance

K-Means: ~O(N × k × iters), where N = pixels after optional resize.

Heuristics: O(N) on the Lab channels.

Typical runtime: Sub-second to a few seconds on commodity CPUs for ≤2MP images with k=3.

## Evaluation Protocol (Recommended)

Dataset: Curate labeled images for rust, zinc, normal under varied lighting/backgrounds.

Split: Train-free method; still use dev set to tune lab_delta, rust_thr, zinc_thr; hold out a test set.

Metrics: Accuracy, per-class precision/recall/F1; confusion matrix to catch “rust↔zinc” confusions.

Stress Tests: Vary illumination, white balance, exposure; add clutter/background metals; scale/resize sensitivity.

Ablations:

Different lab_delta values

Median vs. mean baselines for a*/b* thresholds

Removing K-Means (should not affect the classification, only explainability)

## Assumptions & Limitations

Heuristic, not material science: Color alone can’t guarantee presence of oxides/plating.

Lighting-dependent: Harsh color casts and shadows can skew ratios.

Background leakage: Non-metal regions may influence medians and ratios.

Specular highlights: Can wash out chroma; consider controlled lighting for best results.

Mitigations:

Encourage consistent, diffuse illumination and controlled background.

Optionally crop/segment to the object of interest.

Adjust lab_delta and thresholds per environment.

## Reproducibility

Dependencies: pinned in requirements.txt (use opencv-python-headless).

Determinism: OpenCV k-means with KMEANS_PP_CENTERS is typically stable; for strict reproducibility fix random seeds if you wrap k-means calls accordingly.

Reports: JSON artifacts and palette images are saved for auditability.

## Deployment Notes

API: uvicorn api:app --host 0.0.0.0 --port 8000; OpenAPI docs at /docs.

UI: python gradio_app.py for operator testing/threshold tuning.

Batch: Use color.py/main.py to generate reports; classify.py to re-classify with updated thresholds.

## Ethical Use

Intended for screening/triage in industrial contexts, not definitive corrosion/plating certification.

Users should verify with appropriate physical/chemical tests when decisions carry risk.