hassonofer commited on
Commit
30bde84
·
verified ·
1 Parent(s): bd54730

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +68 -3
README.md CHANGED
@@ -1,3 +1,68 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - birder
4
+ - pytorch
5
+ license: apache-2.0
6
+ ---
7
+
8
+ # Model Card for CLIP-based Aesthetic Predictor
9
+
10
+ A simple MLP intended to run on CLIP embeddings to predict the "aesthetic quality" of an image (how much people like it on average).
11
+
12
+ Trained by Christoph Schuhmann and adapted to suit the [Vision Data Curation](https://gitlab.com/birder/vision-data-curation) project.
13
+
14
+ For more information see: <https://github.com/christophschuhmann/improved-aesthetic-predictor>
15
+
16
+ ## Model Details
17
+
18
+ - **Model Type:** Aesthetic score regression model
19
+ - **Input:** OpenAI CLIP embeddings ([vit_l14_pn_quick_gelu_openai-clip](https://huggingface.co/birder-project/vit_l14_pn_quick_gelu_openai-clip))
20
+ - **Output:** A score between 0 and 10, where higher values correspond to more aesthetic images
21
+
22
+ Original authorship: Adapted from Christoph Schuhmann's MLP Aesthetic Score Predictor
23
+
24
+ ## Model Usage
25
+
26
+ This classifier operates on CLIP image embeddings rather than raw pixels. To run inference with the Birder framework:
27
+
28
+ ```sh
29
+ # Download the CLIP backbone
30
+ python -m birder.tools download-model vit_l14_pn_quick_gelu_openai-clip
31
+
32
+ # Run prediction on a dataset
33
+ python -m birder.scripts.predict \
34
+ -n vit_l14_pn_quick_gelu \
35
+ -t openai-clip \
36
+ --simple-crop \
37
+ --gpu \
38
+ --parallel \
39
+ --batch-size 256 \
40
+ --chunk-size 50000 \
41
+ --amp \
42
+ --amp-dtype bfloat16 \
43
+ --save-logits \
44
+ --suffix optional-dataset-name \
45
+ path/to/dataset
46
+
47
+ # Can now run the aesthetic predictor on the saved logits
48
+ ```
49
+
50
+ ## Intended Use
51
+
52
+ Primary use case: Ranking or filtering images by aesthetic appeal, dataset curation, and training data selection.
53
+
54
+ Recommended scope: Research, dataset preparation, and large-scale data analysis.
55
+
56
+ Not intended for: As a measure of artistic merit, cultural value, or taste preferences of specific individuals.
57
+
58
+ ## Citation
59
+
60
+ ```bibtex
61
+ @misc{christophschuhmann2022improved-aesthetic-predictor,
62
+ author = {Christoph Schuhmann},
63
+ title = {MLP Aesthetic Score Predictor},
64
+ year = {2022},
65
+ url = {https://github.com/christophschuhmann/improved-aesthetic-predictor},
66
+ note = {Accessed: August 22, 2025},
67
+ }
68
+ ```