Felldude's picture
Update README.md
cecfc63 verified
---
language:
- en
license: cc-by-nc-nd-4.0
tags:
- image-to-image
- 16bit-reconstruction
- hdr
- custom-vision
datasets:
- custom-dataset-8bit-to-16bit
metrics:
- name: Median MAE
type: regression
value: 410
- name: LPIPS (Alex)
type: perceptual
value: 0.044
---
# 16bit-from-8bit Image Reconstruction Model
This model reconstructs **16-bit per channel images** from standard **8-bit input images**. It is trained on paired datasets and optimized to preserve color fidelity, structural consistency, and high-frequency detail.
- **Median MAE:** 410
- **LPIPS (Alex):** ~0.044 (60-image evaluation)
- **Architecture Update:** Added Leaky ReLU
- **Training Resolution:** 256Γ—256(46k Patches from Raw HDR images with 8,580 48bit synthetic images.)
- **Training Resolution:** 512Γ—512 (Hand Selected Dataset 2k)
- **Training Resolution:** 1024Γ—1024 (Hand Selected Dataset 500)
---
## Dataset
- **Total images:** 54,580
- **RAW patch images:** 46,000 @ 256x256 (~10 GB)
- **48-bit synthetic images:** 8,580 (~2 GB)
Addtional 10GB in Hand Selected RAW images, for the 512px and 1024px High Frequency Training Passes
---
### MAE Distribution (8-bit β†’ 16-bit reconstruction)
| MAE Range | Accuracy Comment | Percent (%) |
|------------ |--------------------------------------|------------ |
| β‰₯1000 | Occasionally visible in uniform areas | 1.06 |
| 600–1000 | Almost never visible | 10.03 |
| 400–600 | Fully imperceptible | 27.39 |
| 200–400 | Near perfect | 59.95 |
| ≀200 | Near exact scientific | 1.57 |
### Perceptual Metrics (60-image test set)
| Metric | Result | Interpretation |
|-------------------|--------|----------------|
| LPIPS (Alex) | 0.044 | Low perceptual distance / high similarity |
| Gradient Energy | 0.088 β†’ 0.108 | Preserved fine detail, slight sharpening |
| FFT Structure Score| 1.07 β†’ 1.23 | Improved high-frequency retention |
| Histogram Continuity | 11.2 β†’ 11.3 | Stable tonal distribution |
## Interpretation Summary
- LPIPS values (~0.03–0.07 range) indicate **high perceptual similarity**
- Structural metrics (FFT + gradients) show **consistent detail reconstruction**
- Histogram stability indicates **no major tonal drift between bit-depth conversions**
---
## Intended Use
**Primary Use Cases**
- Reconstruction of 16-bit per channel images from 8-bit input
- JPG & GIF post-processing and enhancement
- Archival and art restoration workflows
**Not Intended For**
- Lossless scientific measurement or precision tasks
- Medical AI enhancement