narugo1992
commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -14,6 +14,28 @@ The model used to predict the types of anime images, which includes the followin
|
|
| 14 |
* Bangumi: Screenshots from anime videos.
|
| 15 |
* Comic: Images of manga that contain a significant amount of text or panel sequences.
|
| 16 |
* Illustration: General anime illustrations.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 17 |
|
| 18 |
| Model | FLOPs | Accuracy | Confusion Matrix | Description |
|
| 19 |
|:--------------------:|:------:|:--------:|:-------------------------------------------------------------------------------------------------------------------------:|----------------------------------------------------------------------------------|
|
|
@@ -23,24 +45,4 @@ The model used to predict the types of anime images, which includes the followin
|
|
| 23 |
| mobilenetv3_dist | 0.63G | 91.98% | [Confusion Matrix](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_dist/plot_confusion.png) | Distrillated from caformer_s36_plus, using mobilenetv3_large_100 with focal loss |
|
| 24 |
| mobilenetv3_sce | 0.63G | 89.92% | [Confusion Matrix](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_sce/plot_confusion.png) | Model: mobilenetv3_large_100 from timm, use SCELoss as loss function |
|
| 25 |
| mobilenetv3_sce_dist | 0.63G | 92.35% | [Confusion Matrix](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_sce_dist/plot_confusion.png) | Distrillated from caformer_s36_plus, using mobilenetv3_large_100 with SCELoss |
|
| 26 |
-
| mobilevitv2_150 | 9.09G | 88.21% | [Confusion Matrix](https://huggingface.co/deepghs/anime_classification/blob/main/mobilevitv2_150/plot_confusion.png) | Model: mobilevitv2_150 from timm |
|
| 27 |
-
|
| 28 |
-
| Name | FLOPS | Params | Accuracy | AUC | Confusion | Labels |
|
| 29 |
-
|:----------------------------:|:-------:|:--------:|:----------:|:------:|:--------------------------------------------------------------------------------------------------------------------------:|:--------------------------------------------------------:|
|
| 30 |
-
| caformer_s36 | 22.10G | 37.22M | 88.19% | N/A | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 31 |
-
| caformer_s36_plus | 22.10G | 37.22M | 93.47% | 0.9891 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_plus/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 32 |
-
| caformer_s36_v1.1_focal | 22.10G | 37.22M | 95.99% | 0.9967 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.1_focal/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 33 |
-
| caformer_s36_v1.2_focal | 22.10G | 37.22M | 97.23% | 0.9982 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.2_focal/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 34 |
-
| caformer_s36_v1.3_focal | 22.10G | 37.22M | 97.16% | 0.9982 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.3_focal/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 35 |
-
| caformer_s36_v1.4_focal | 22.10G | 37.22M | 95.82% | 0.9967 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.4_focal/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 36 |
-
| caformer_s36_v1.4_focal_fp32 | 22.10G | 37.22M | 95.98% | 0.9969 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.4_focal_fp32/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 37 |
-
| caformer_s36_v1 | 22.10G | 37.22M | 94.72% | 0.9934 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 38 |
-
| mobilenetv3 | 0.63G | 4.18M | 88.96% | N/A | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 39 |
-
| mobilenetv3_dist | 0.63G | 4.18M | 91.98% | 0.9879 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 40 |
-
| mobilenetv3_sce | 0.63G | 4.18M | 89.92% | 0.9786 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_sce/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 41 |
-
| mobilenetv3_sce_dist | 0.63G | 4.18M | 92.35% | 0.9854 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_sce_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 42 |
-
| mobilenetv3_v1.2_dist | 0.63G | 4.18M | 96.53% | 0.9972 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_v1.2_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 43 |
-
| mobilenetv3_v1.3_dist | 0.63G | 4.18M | 96.41% | 0.9973 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_v1.3_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 44 |
-
| mobilenetv3_v1.4_dist | 0.63G | 4.18M | 94.77% | 0.9950 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_v1.4_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 45 |
-
| mobilenetv3_v1_dist | 0.63G | 4.18M | 94.04% | 0.9928 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_v1_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 46 |
-
| mobilevitv2_150 | 9.09G | 9.79M | 88.21% | N/A | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilevitv2_150/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
|
|
|
| 14 |
* Bangumi: Screenshots from anime videos.
|
| 15 |
* Comic: Images of manga that contain a significant amount of text or panel sequences.
|
| 16 |
* Illustration: General anime illustrations.
|
| 17 |
+
* Not Painting: (Only available in new models) Any content that cannot be called a painting, such as artist promotional posts, game screenshots, chat logs, etc.
|
| 18 |
+
|
| 19 |
+
| Name | FLOPS | Params | Accuracy | AUC | Confusion | Labels |
|
| 20 |
+
|:-----------------------------:|:-------:|:--------:|:----------:|:------:|:---------------------------------------------------------------------------------------------------------------------------:|:--------------------------------------------------------:|
|
| 21 |
+
| caformer_s36_v1.4_focal_fixed | 22.10G | 37.22M | 95.85% | 0.9968 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.4_focal_fixed/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 22 |
+
| caformer_s36_v1.4_focal_fp32 | 22.10G | 37.22M | 95.98% | 0.9969 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.4_focal_fp32/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 23 |
+
| mobilenetv3_v1.4_dist | 0.63G | 4.18M | 94.77% | 0.9950 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_v1.4_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 24 |
+
| caformer_s36_v1.4_focal | 22.10G | 37.22M | 95.82% | 0.9967 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.4_focal/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 25 |
+
| mobilenetv3_v1.3_dist | 0.63G | 4.18M | 96.41% | 0.9973 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_v1.3_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 26 |
+
| caformer_s36_v1.3_focal | 22.10G | 37.22M | 97.16% | 0.9982 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.3_focal/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 27 |
+
| mobilenetv3_v1.2_dist | 0.63G | 4.18M | 96.53% | 0.9972 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_v1.2_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 28 |
+
| caformer_s36_v1.2_focal | 22.10G | 37.22M | 97.23% | 0.9982 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.2_focal/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 29 |
+
| caformer_s36_v1.1_focal | 22.10G | 37.22M | 95.99% | 0.9967 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1.1_focal/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 30 |
+
| mobilenetv3_v1_dist | 0.63G | 4.18M | 94.04% | 0.9928 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_v1_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 31 |
+
| caformer_s36_v1 | 22.10G | 37.22M | 94.72% | 0.9934 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_v1/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration`, `not_painting` |
|
| 32 |
+
| mobilenetv3_dist | 0.63G | 4.18M | 91.98% | 0.9879 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 33 |
+
| mobilenetv3_sce_dist | 0.63G | 4.18M | 92.35% | 0.9854 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_sce_dist/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 34 |
+
| caformer_s36_plus | 22.10G | 37.22M | 93.47% | 0.9891 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36_plus/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 35 |
+
| mobilevitv2_150 | 9.09G | 9.79M | 88.21% | N/A | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilevitv2_150/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 36 |
+
| mobilenetv3 | 0.63G | 4.18M | 88.96% | N/A | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 37 |
+
| caformer_s36 | 22.10G | 37.22M | 88.19% | N/A | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/caformer_s36/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 38 |
+
| mobilenetv3_sce | 0.63G | 4.18M | 89.92% | 0.9786 | [confusion](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_sce/plot_confusion.png) | `3d`, `bangumi`, `comic`, `illustration` |
|
| 39 |
|
| 40 |
| Model | FLOPs | Accuracy | Confusion Matrix | Description |
|
| 41 |
|:--------------------:|:------:|:--------:|:-------------------------------------------------------------------------------------------------------------------------:|----------------------------------------------------------------------------------|
|
|
|
|
| 45 |
| mobilenetv3_dist | 0.63G | 91.98% | [Confusion Matrix](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_dist/plot_confusion.png) | Distrillated from caformer_s36_plus, using mobilenetv3_large_100 with focal loss |
|
| 46 |
| mobilenetv3_sce | 0.63G | 89.92% | [Confusion Matrix](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_sce/plot_confusion.png) | Model: mobilenetv3_large_100 from timm, use SCELoss as loss function |
|
| 47 |
| mobilenetv3_sce_dist | 0.63G | 92.35% | [Confusion Matrix](https://huggingface.co/deepghs/anime_classification/blob/main/mobilenetv3_sce_dist/plot_confusion.png) | Distrillated from caformer_s36_plus, using mobilenetv3_large_100 with SCELoss |
|
| 48 |
+
| mobilevitv2_150 | 9.09G | 88.21% | [Confusion Matrix](https://huggingface.co/deepghs/anime_classification/blob/main/mobilevitv2_150/plot_confusion.png) | Model: mobilevitv2_150 from timm |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|