OJ-1 commited on
Commit
9a5bac7
·
verified ·
1 Parent(s): 5b259ab

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -3
README.md CHANGED
@@ -3,8 +3,17 @@ license: agpl-3.0
3
  pipeline_tag: image-classification
4
  ---
5
 
6
- A tiny 230K param, opinionated image compression classifier. CNN. Trained on JPEG, JXL, AVIF, WEBP and PNG. Hits 94.3% accuracy on the task.
7
 
8
- Attempts to flags images below q=80 for JPEG and below 65 for the rest. This was driven by research showing image models trained on many q=75 JPEGS had massive performance penalties.
9
 
10
- Training was 128x128 patches + multi-scale high freq residuals. DCT aware 8x8 stem and dual heads (binary acceptable/not + codec classifier).
 
 
 
 
 
 
 
 
 
 
3
  pipeline_tag: image-classification
4
  ---
5
 
6
+ A small 2M param CNN classifier to predict the quality factor of lossy image codecs. Covers JPEG, JXL, AVIF and WEBP. Dataset (45k) included RAW photography and PNG illustrations.
7
 
8
+ Overall accuracy was 95.3% on train, 96.6% on val (7% split). Take the predictions with a large grain of salt, its not yet ready for production.
9
 
10
+ This model will *not* work properly in the following scenarios:
11
+
12
+ - Really small images (sub 512px)
13
+ - Mutiple resize and/or compression loops (would have ballooned training time for first iteration)
14
+ - <insert lossy format> renamed to PNG's
15
+ - Mixed media; low quality JPEG background with an overlaid illustration.
16
+
17
+ Motivated by research showing image models trained on many <= q=75 JPEGS had negative outcomes.
18
+
19
+ Credit: Rimuru original model and original code