Makki2104 commited on Dec 30, 2025

Commit

0ba50d5

verified ·

1 Parent(s): d2f91cd

Add files using upload-large-folder tool

Browse files

Files changed (20) hide show

convnextv2_huge.dbv4-full/README.md +148 -0
convnextv2_huge.dbv4-full/preprocess.json +101 -0
convnextv2_huge.dbv4-full/sample.webp +0 -0
convnextv2_huge.dbv4-full/selected_tags.csv +0 -0
convnextv2_huge.dbv4-full/thresholds.csv +4 -0
resnet101.dbv4-full/.gitattributes +35 -0
resnet101.dbv4-full/README.md +187 -0
resnet101.dbv4-full/categories.json +14 -0
resnet101.dbv4-full/config.json +0 -0
resnet101.dbv4-full/meta.json +0 -0
resnet101.dbv4-full/metrics.json +25 -0
resnet101.dbv4-full/preprocess.json +95 -0
resnet101.dbv4-full/sample.webp +0 -0
resnet101.dbv4-full/selected_tags.csv +0 -0
resnet101.dbv4-full/thresholds.csv +4 -0
resnet152.dbv4-full/.gitattributes +35 -0
resnet152.dbv4-full/categories.json +14 -0
resnet152.dbv4-full/config.json +0 -0
swinv2_base_window8_256.dbv4a-full/pytorch_model.bin +3 -0
vit_base_patch16_224.dbv4-full/pytorch_model.bin +3 -0

convnextv2_huge.dbv4-full/README.md ADDED Viewed

	@@ -0,0 +1,148 @@

+---
+tags:
+- image-classification
+- timm
+- transformers
+- animetimm
+- dghs-imgutils
+library_name: timm
+license: gpl-3.0
+datasets:
+- animetimm/danbooru-wdtagger-v4-w640-ws-full
+base_model:
+- timm/convnextv2_huge.fcmae_ft_in22k_in1k_512
+---
+# Anime Tagger convnextv2_huge.dbv4-full
+## Model Details
+- **Model Type:** Multilabel Image classification / feature backbone
+- **Model Stats:**
+  - Params: 692.6M
+  - FLOPs / MACs: 1.2T / 600.4G
+  - Image size: train = 512 x 512, test = 512 x 512
+- **Dataset:** [animetimm/danbooru-wdtagger-v4-w640-ws-full](https://huggingface.co/datasets/animetimm/danbooru-wdtagger-v4-w640-ws-full)
+  - Tags Count: 12476
+    - General (#0) Tags Count: 9225
+    - Character (#4) Tags Count: 3247
+    - Rating (#9) Tags Count: 4
+## Results
+|     #      |    Macro@0.40 (F1/MCC/P/R)    |    Micro@0.40 (F1/MCC/P/R)    |  Macro@Best (F1/P/R)  |
+|:----------:|:-----------------------------:|:-----------------------------:|:---------------------:|
+| Validation | 0.580 / 0.584 / 0.626 / 0.556 | 0.697 / 0.696 / 0.692 / 0.701 |          ---          |
+|    Test    | 0.580 / 0.584 / 0.627 / 0.556 | 0.697 / 0.696 / 0.693 / 0.702 | 0.611 / 0.612 / 0.630 |
+* `Macro/Micro@0.40` means the metrics on the threshold 0.40.
+* `Macro@Best` means the mean metrics on the tag-level thresholds on each tags, which should have the best F1 scores.
+## Thresholds
+|  Category  |   Name    |  Alpha  |  Threshold  |  Micro@Thr (F1/P/R)   |  Macro@0.40 (F1/P/R)  |  Macro@Best (F1/P/R)  |
+|:----------:|:---------:|:-------:|:-----------:|:---------------------:|:---------------------:|:---------------------:|
+|     0      |  general  |    1    |    0.38     | 0.685 / 0.673 / 0.697 | 0.457 / 0.514 / 0.430 | 0.494 / 0.490 / 0.524 |
+|     4      | character |    1    |    0.51     | 0.946 / 0.962 / 0.930 | 0.930 / 0.948 / 0.915 | 0.943 / 0.959 / 0.930 |
+|     9      |  rating   |    1    |    0.24     | 0.828 / 0.790 / 0.871 | 0.833 / 0.823 / 0.843 | 0.835 / 0.812 / 0.861 |
+* `Micro@Thr` means the metrics on the category-level suggested thresholds, which are listed in the table above.
+* `Macro@0.40` means the metrics on the threshold 0.40.
+* `Macro@Best` means the metrics on the tag-level thresholds on each tags, which should have the best F1 scores.
+For tag-level thresholds, you can find them in [selected_tags.csv](https://huggingface.co/animetimm/convnextv2_huge.dbv4-full/resolve/main/selected_tags.csv).
+## How to Use
+We provided a sample image for our code samples, you can find it [here](https://huggingface.co/animetimm/convnextv2_huge.dbv4-full/blob/main/sample.webp).
+### Use TIMM And Torch
+Install [dghs-imgutils](https://github.com/deepghs/imgutils), [timm](https://github.com/huggingface/pytorch-image-models) and other necessary requirements with the following command
+```shell
+pip install 'dghs-imgutils>=0.19.0' torch huggingface_hub timm pillow pandas
+```
+After that you can load this model with timm library, and use it for train, validation and test, with the following code
+```python
+import json
+import pandas as pd
+import torch
+from huggingface_hub import hf_hub_download
+from imgutils.data import load_image
+from imgutils.preprocess import create_torchvision_transforms
+from timm import create_model
+repo_id = 'animetimm/convnextv2_huge.dbv4-full'
+model = create_model(f'hf-hub:{repo_id}', pretrained=True)
+model.eval()
+with open(hf_hub_download(repo_id=repo_id, repo_type='model', filename='preprocess.json'), 'r') as f:
+    preprocessor = create_torchvision_transforms(json.load(f)['test'])
+# Compose(
+#     PadToSize(size=(512, 512), interpolation=bilinear, background_color=white)
+#     Resize(size=(512, 512), interpolation=bicubic, max_size=None, antialias=True)
+#     CenterCrop(size=[512, 512])
+#     MaybeToTensor()
+#     Normalize(mean=tensor([0.4850, 0.4560, 0.4060]), std=tensor([0.2290, 0.2240, 0.2250]))
+# )
+image = load_image('https://huggingface.co/animetimm/convnextv2_huge.dbv4-full/resolve/main/sample.webp')
+input_ = preprocessor(image).unsqueeze(0)
+# input_, shape: torch.Size([1, 3, 512, 512]), dtype: torch.float32
+with torch.no_grad():
+    output = model(input_)
+    prediction = torch.sigmoid(output)[0]
+# output, shape: torch.Size([1, 12476]), dtype: torch.float32
+# prediction, shape: torch.Size([12476]), dtype: torch.float32
+df_tags = pd.read_csv(
+    hf_hub_download(repo_id=repo_id, repo_type='model', filename='selected_tags.csv'),
+    keep_default_na=False
+)
+tags = df_tags['name']
+mask = prediction.numpy() >= df_tags['best_threshold']
+print(dict(zip(tags[mask].tolist(), prediction[mask].tolist())))
+# {'sensitive': 0.9900546073913574,
+#  '1girl': 0.9986221790313721,
+#  'solo': 0.9894072413444519,
+#  'looking_at_viewer': 0.8689708113670349,
+#  'blush': 0.8729097843170166,
+#  'smile': 0.9395995736122131,
+#  'short_hair': 0.6831153631210327,
+#  'long_sleeves': 0.6779903173446655,
+#  'brown_hair': 0.802174985408783,
+#  'holding': 0.3276722729206085,
+#  'dress': 0.6280677318572998,
+#  'sitting': 0.6450996994972229,
+#  'purple_eyes': 0.8072393536567688,
+#  'flower': 0.9524818062782288,
+#  'braid': 0.8764650225639343,
+#  'outdoors': 0.47000938653945923,
+#  'tears': 0.9879008531570435,
+#  'floral_print': 0.5994200706481934,
+#  'crying': 0.34614139795303345,
+#  'plant': 0.3870095908641815,
+#  'crown_braid': 0.7048561573028564,
+#  'happy_tears': 0.759681224822998,
+#  'pavement': 0.2870482802391052,
+#  'wiping_tears': 0.9898664951324463,
+#  'brick_floor': 0.5737900137901306}
+```
+## Citation
+```
+@misc{convnextv2_huge_dbv4_full,
+  title        = {Anime Tagger convnextv2_huge.dbv4-full},
+  author       = {narugo1992 and Deep Generative anime Hobbyist Syndicate (DeepGHS)},
+  year         = {2025},
+  howpublished = {\url{https://huggingface.co/animetimm/convnextv2_huge.dbv4-full}},
+  note         = {A large-scale anime-style image classification model based on convnextv2_huge architecture for multi-label tagging with 12476 tags, trained on anime dataset dbv4-full (\url{https://huggingface.co/datasets/animetimm/danbooru-wdtagger-v4-w640-ws-full}). Model parameters: 692.6M, FLOPs: 1.2T, input resolution: 512×512.},
+  license      = {gpl-3.0}
+}
+```

convnextv2_huge.dbv4-full/preprocess.json ADDED Viewed

	@@ -0,0 +1,101 @@

+{
+    "pre": [
+        {
+            "background_color": "white",
+            "interpolation": "bilinear",
+            "size": [
+                512,
+                512
+            ],
+            "type": "pad_to_size"
+        }
+    ],
+    "test": [
+        {
+            "background_color": "white",
+            "interpolation": "bilinear",
+            "size": [
+                512,
+                512
+            ],
+            "type": "pad_to_size"
+        },
+        {
+            "antialias": true,
+            "interpolation": "bicubic",
+            "max_size": null,
+            "size": [
+                512,
+                512
+            ],
+            "type": "resize"
+        },
+        {
+            "size": [
+                512,
+                512
+            ],
+            "type": "center_crop"
+        },
+        {
+            "type": "maybe_to_tensor"
+        },
+        {
+            "mean": [
+                0.48500001430511475,
+                0.4560000002384186,
+                0.4059999883174896
+            ],
+            "std": [
+                0.2290000021457672,
+                0.2240000069141388,
+                0.22499999403953552
+            ],
+            "type": "normalize"
+        }
+    ],
+    "val": [
+        {
+            "background_color": "white",
+            "interpolation": "bilinear",
+            "size": [
+                512,
+                512
+            ],
+            "type": "pad_to_size"
+        },
+        {
+            "antialias": true,
+            "interpolation": "bicubic",
+            "max_size": null,
+            "size": [
+                512,
+                512
+            ],
+            "type": "resize"
+        },
+        {
+            "size": [
+                512,
+                512
+            ],
+            "type": "center_crop"
+        },
+        {
+            "type": "maybe_to_tensor"
+        },
+        {
+            "mean": [
+                0.48500001430511475,
+                0.4560000002384186,
+                0.4059999883174896
+            ],
+            "std": [
+                0.2290000021457672,
+                0.2240000069141388,
+                0.22499999403953552
+            ],
+            "type": "normalize"
+        }
+    ]
+}

convnextv2_huge.dbv4-full/sample.webp ADDED Viewed

convnextv2_huge.dbv4-full/selected_tags.csv ADDED Viewed

The diff for this file is too large to render. See raw diff

convnextv2_huge.dbv4-full/thresholds.csv ADDED Viewed

	@@ -0,0 +1,4 @@

+category,name,alpha,threshold,f1,precision,recall
+0,general,1.0,0.38,0.685090329194269,0.6732974998130495,0.6973036267335329
+4,character,1.0,0.51,0.9457540360090104,0.9618129946021976,0.930222529196658
+9,rating,1.0,0.24000000000000002,0.828248843557246,0.7895431723985823,0.8709450692041523

resnet101.dbv4-full/.gitattributes ADDED Viewed

	@@ -0,0 +1,35 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text

resnet101.dbv4-full/README.md ADDED Viewed

	@@ -0,0 +1,187 @@

+---
+tags:
+- image-classification
+- timm
+- transformers
+- animetimm
+- dghs-imgutils
+library_name: timm
+license: gpl-3.0
+datasets:
+- animetimm/danbooru-wdtagger-v4-w640-ws-full
+base_model:
+- timm/resnet101.tv_in1k
+---
+# Anime Tagger resnet101.dbv4-full
+## Model Details
+- **Model Type:** Multilabel Image classification / feature backbone
+- **Model Stats:**
+  - Params: 68.1M
+  - FLOPs / MACs: 46.0G / 22.9G
+  - Image size: train = 384 x 384, test = 384 x 384
+- **Dataset:** [animetimm/danbooru-wdtagger-v4-w640-ws-full](https://huggingface.co/datasets/animetimm/danbooru-wdtagger-v4-w640-ws-full)
+  - Tags Count: 12476
+    - General (#0) Tags Count: 9225
+    - Character (#4) Tags Count: 3247
+    - Rating (#9) Tags Count: 4
+## Results
+|     #      |    Macro@0.40 (F1/MCC/P/R)    |    Micro@0.40 (F1/MCC/P/R)    |  Macro@Best (F1/P/R)  |
+|:----------:|:-----------------------------:|:-----------------------------:|:---------------------:|
+| Validation | 0.436 / 0.448 / 0.535 / 0.395 | 0.622 / 0.622 / 0.672 / 0.578 |          ---          |
+|    Test    | 0.437 / 0.448 / 0.535 / 0.396 | 0.622 / 0.623 / 0.672 / 0.579 | 0.481 / 0.509 / 0.482 |
+* `Macro/Micro@0.40` means the metrics on the threshold 0.40.
+* `Macro@Best` means the mean metrics on the tag-level thresholds on each tags, which should have the best F1 scores.
+## Thresholds
+|  Category  |   Name    |  Alpha  |  Threshold  |  Micro@Thr (F1/P/R)   |  Macro@0.40 (F1/P/R)  |  Macro@Best (F1/P/R)  |
+|:----------:|:---------:|:-------:|:-----------:|:---------------------:|:---------------------:|:---------------------:|
+|     0      |  general  |    1    |    0.33     | 0.612 / 0.619 / 0.605 | 0.305 / 0.421 / 0.262 | 0.357 / 0.374 / 0.374 |
+|     4      | character |    1    |    0.49     | 0.845 / 0.906 / 0.791 | 0.812 / 0.858 / 0.777 | 0.833 / 0.893 / 0.789 |
+|     9      |  rating   |    1    |     0.4     | 0.800 / 0.755 / 0.851 | 0.805 / 0.778 / 0.837 | 0.806 / 0.771 / 0.848 |
+* `Micro@Thr` means the metrics on the category-level suggested thresholds, which are listed in the table above.
+* `Macro@0.40` means the metrics on the threshold 0.40.
+* `Macro@Best` means the metrics on the tag-level thresholds on each tags, which should have the best F1 scores.
+For tag-level thresholds, you can find them in [selected_tags.csv](https://huggingface.co/animetimm/resnet101.dbv4-full/resolve/main/selected_tags.csv).
+## How to Use
+We provided a sample image for our code samples, you can find it [here](https://huggingface.co/animetimm/resnet101.dbv4-full/blob/main/sample.webp).
+### Use TIMM And Torch
+Install [dghs-imgutils](https://github.com/deepghs/imgutils), [timm](https://github.com/huggingface/pytorch-image-models) and other necessary requirements with the following command
+```shell
+pip install 'dghs-imgutils>=0.17.0' torch huggingface_hub timm pillow pandas
+```
+After that you can load this model with timm library, and use it for train, validation and test, with the following code
+```python
+import json
+import pandas as pd
+import torch
+from huggingface_hub import hf_hub_download
+from imgutils.data import load_image
+from imgutils.preprocess import create_torchvision_transforms
+from timm import create_model
+repo_id = 'animetimm/resnet101.dbv4-full'
+model = create_model(f'hf-hub:{repo_id}', pretrained=True)
+model.eval()
+with open(hf_hub_download(repo_id=repo_id, repo_type='model', filename='preprocess.json'), 'r') as f:
+    preprocessor = create_torchvision_transforms(json.load(f)['test'])
+# Compose(
+#     PadToSize(size=(512, 512), interpolation=bilinear, background_color=white)
+#     Resize(size=384, interpolation=bilinear, max_size=None, antialias=True)
+#     CenterCrop(size=[384, 384])
+#     MaybeToTensor()
+#     Normalize(mean=tensor([0.4850, 0.4560, 0.4060]), std=tensor([0.2290, 0.2240, 0.2250]))
+# )
+image = load_image('https://huggingface.co/animetimm/resnet101.dbv4-full/resolve/main/sample.webp')
+input_ = preprocessor(image).unsqueeze(0)
+# input_, shape: torch.Size([1, 3, 384, 384]), dtype: torch.float32
+with torch.no_grad():
+    output = model(input_)
+    prediction = torch.sigmoid(output)[0]
+# output, shape: torch.Size([1, 12476]), dtype: torch.float32
+# prediction, shape: torch.Size([12476]), dtype: torch.float32
+df_tags = pd.read_csv(
+    hf_hub_download(repo_id=repo_id, repo_type='model', filename='selected_tags.csv'),
+    keep_default_na=False
+)
+tags = df_tags['name']
+mask = prediction.numpy() >= df_tags['best_threshold']
+print(dict(zip(tags[mask].tolist(), prediction[mask].tolist())))
+# {'general': 0.5100178718566895,
+#  'sensitive': 0.5034157037734985,
+#  '1girl': 0.9962267875671387,
+#  'solo': 0.9669082760810852,
+#  'looking_at_viewer': 0.8127952814102173,
+#  'blush': 0.7912614941596985,
+#  'smile': 0.9032713770866394,
+#  'short_hair': 0.7837649583816528,
+#  'shirt': 0.5146411657333374,
+#  'long_sleeves': 0.7224600315093994,
+#  'brown_hair': 0.5260339379310608,
+#  'holding': 0.5752436518669128,
+#  'dress': 0.5642756223678589,
+#  'closed_mouth': 0.4826013743877411,
+#  'purple_eyes': 0.7590888142585754,
+#  'flower': 0.9180877208709717,
+#  'braid': 0.9453270435333252,
+#  'red_hair': 0.8512048721313477,
+#  'blunt_bangs': 0.5289319753646851,
+#  'bob_cut': 0.22592417895793915,
+#  'plant': 0.5463797450065613,
+#  'blue_flower': 0.6992892026901245,
+#  'crown_braid': 0.7925195097923279,
+#  'potted_plant': 0.5136846899986267,
+#  'flower_pot': 0.4357028007507324,
+#  'wiping_tears': 0.3059103488922119}
+```
+### Use ONNX Model For Inference
+Install [dghs-imgutils](https://github.com/deepghs/imgutils) with the following command
+```shell
+pip install 'dghs-imgutils>=0.17.0'
+```
+Use `multilabel_timm_predict` function with the following code
+```python
+from imgutils.generic import multilabel_timm_predict
+general, character, rating = multilabel_timm_predict(
+    'https://huggingface.co/animetimm/resnet101.dbv4-full/resolve/main/sample.webp',
+    repo_id='animetimm/resnet101.dbv4-full',
+    fmt=('general', 'character', 'rating'),
+)
+print(general)
+# {'1girl': 0.9962266683578491,
+#  'solo': 0.96690833568573,
+#  'braid': 0.9453268647193909,
+#  'flower': 0.9180880784988403,
+#  'smile': 0.9032710790634155,
+#  'red_hair': 0.8512046337127686,
+#  'looking_at_viewer': 0.8127949833869934,
+#  'crown_braid': 0.792519211769104,
+#  'blush': 0.7912609577178955,
+#  'short_hair': 0.7837648391723633,
+#  'purple_eyes': 0.7590886354446411,
+#  'long_sleeves': 0.7224597930908203,
+#  'blue_flower': 0.6992897391319275,
+#  'holding': 0.5752434134483337,
+#  'dress': 0.5642745494842529,
+#  'plant': 0.5463811755180359,
+#  'blunt_bangs': 0.5289315581321716,
+#  'brown_hair': 0.5260326862335205,
+#  'shirt': 0.5146413445472717,
+#  'potted_plant': 0.5136858820915222,
+#  'closed_mouth': 0.48260119557380676,
+#  'flower_pot': 0.4357031583786011,
+#  'wiping_tears': 0.30590835213661194,
+#  'bob_cut': 0.22592449188232422}
+print(character)
+# {}
+print(rating)
+# {'general': 0.5100165009498596, 'sensitive': 0.5034170150756836}
+```
+For further information, see [documentation of function multilabel_timm_predict](https://dghs-imgutils.deepghs.org/main/api_doc/generic/multilabel_timm.html#multilabel-timm-predict).

resnet101.dbv4-full/categories.json ADDED Viewed

	@@ -0,0 +1,14 @@

+[
+    {
+        "category": 0,
+        "name": "general"
+    },
+    {
+        "category": 4,
+        "name": "character"
+    },
+    {
+        "category": 9,
+        "name": "rating"
+    }
+]

resnet101.dbv4-full/config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

resnet101.dbv4-full/meta.json ADDED Viewed

The diff for this file is too large to render. See raw diff

resnet101.dbv4-full/metrics.json ADDED Viewed

	@@ -0,0 +1,25 @@

+{
+    "test": {
+        "macro_f1": 0.4368686378002167,
+        "macro_mcc": 0.4481600224971771,
+        "macro_precision": 0.5345199108123779,
+        "macro_recall": 0.3957759737968445,
+        "micro_f1": 0.621998131275177,
+        "micro_mcc": 0.6228444576263428,
+        "micro_precision": 0.6722905039787292,
+        "micro_recall": 0.5787064433097839
+    },
+    "val": {
+        "learning_rate": 4.7306720809906175e-06,
+        "loss": 0.40296348299079054,
+        "macro_f1": 0.4364077150821686,
+        "macro_mcc": 0.447799950838089,
+        "macro_precision": 0.5347463488578796,
+        "macro_recall": 0.3953515291213989,
+        "micro_f1": 0.6215192675590515,
+        "micro_mcc": 0.6223735809326172,
+        "micro_precision": 0.6719217300415039,
+        "micro_recall": 0.578150749206543,
+        "step": 93
+    }
+}

resnet101.dbv4-full/preprocess.json ADDED Viewed

	@@ -0,0 +1,95 @@

+{
+    "pre": [
+        {
+            "background_color": "white",
+            "interpolation": "bilinear",
+            "size": [
+                512,
+                512
+            ],
+            "type": "pad_to_size"
+        }
+    ],
+    "test": [
+        {
+            "background_color": "white",
+            "interpolation": "bilinear",
+            "size": [
+                512,
+                512
+            ],
+            "type": "pad_to_size"
+        },
+        {
+            "antialias": true,
+            "interpolation": "bilinear",
+            "max_size": null,
+            "size": 384,
+            "type": "resize"
+        },
+        {
+            "size": [
+                384,
+                384
+            ],
+            "type": "center_crop"
+        },
+        {
+            "type": "maybe_to_tensor"
+        },
+        {
+            "mean": [
+                0.48500001430511475,
+                0.4560000002384186,
+                0.4059999883174896
+            ],
+            "std": [
+                0.2290000021457672,
+                0.2240000069141388,
+                0.22499999403953552
+            ],
+            "type": "normalize"
+        }
+    ],
+    "val": [
+        {
+            "background_color": "white",
+            "interpolation": "bilinear",
+            "size": [
+                512,
+                512
+            ],
+            "type": "pad_to_size"
+        },
+        {
+            "antialias": true,
+            "interpolation": "bilinear",
+            "max_size": null,
+            "size": 384,
+            "type": "resize"
+        },
+        {
+            "size": [
+                384,
+                384
+            ],
+            "type": "center_crop"
+        },
+        {
+            "type": "maybe_to_tensor"
+        },
+        {
+            "mean": [
+                0.48500001430511475,
+                0.4560000002384186,
+                0.4059999883174896
+            ],
+            "std": [
+                0.2290000021457672,
+                0.2240000069141388,
+                0.22499999403953552
+            ],
+            "type": "normalize"
+        }
+    ]
+}

resnet101.dbv4-full/sample.webp ADDED Viewed

resnet101.dbv4-full/selected_tags.csv ADDED Viewed

The diff for this file is too large to render. See raw diff

resnet101.dbv4-full/thresholds.csv ADDED Viewed

	@@ -0,0 +1,4 @@

+category,name,alpha,threshold,f1,precision,recall
+0,general,1.0,0.33,0.6117394521110506,0.6189713640648628,0.604674580327153
+4,character,1.0,0.49,0.844884846805129,0.9064400543098392,0.7911582800403905
+9,rating,1.0,0.4,0.8004352660836954,0.7552656981659241,0.8513513513513513

resnet152.dbv4-full/.gitattributes ADDED Viewed

	@@ -0,0 +1,35 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text

resnet152.dbv4-full/categories.json ADDED Viewed

	@@ -0,0 +1,14 @@

+[
+    {
+        "category": 0,
+        "name": "general"
+    },
+    {
+        "category": 4,
+        "name": "character"
+    },
+    {
+        "category": 9,
+        "name": "rating"
+    }
+]

resnet152.dbv4-full/config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

swinv2_base_window8_256.dbv4a-full/pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1dcb080ae4db05b3e3cce367cb81530cc5f3dbe8c1b8308bd2dbb2bc471c844e
+size 350402567

vit_base_patch16_224.dbv4-full/pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:69b0244f83fd18c74213cf2761aca60e7906a917843bc4997accf10a6489f0f9
+size 383428479