File size: 3,155 Bytes
aa398e2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
---
license: apache-2.0
pipeline_tag: image-classification
tags:
- multi-label
- anime
- danbooru
- safetensors
---

ℹ️ This is a **safetensors + tag JSON version** of the original model [pixai-labs/pixai-tagger-v0.9](https://huggingface.co/pixai-labs/pixai-tagger-v0.9).  
🔄 Converted by [1038lab](https://huggingface.co/1038lab).  
📦 No changes were made to model weights or logic — only converted `.pth``.safetensors` and bundled with `tags_v0.9_13k.json` for convenience.  
🧠 Full credits and model training go to PixAI Labs.

---

<p>
  <img src="https://huggingface.co/pixai-labs/pixai-tagger-v0.9/resolve/main/banner_09_cropped.jpg" style="height:240px;" />
  <a href="https://huggingface.co/pixai-labs/pixai-tagger-v0.9"><strong>Original Model</strong></a> ·
  <a href="#quickstart"><strong>Quickstart</strong></a> ·
  <a href="#training-notes"><strong>Training Notes</strong></a> ·
  <a href="#credits"><strong>Credits</strong></a>
</p>

---

# PixAI Tagger v0.9

A practical anime **multi-label tagger**. Not trying to win benchmarks; trying to be useful.  
**High recall**, updated **character coverage**, trained on a fresh Danbooru snapshot (2025-01).  
We’ll keep shipping: **v1.0** (with updated tags) is next.

> TL;DR  
>
> - ~**13.5k** Danbooru-style tags (**general**, **character**, **copyright**)  
> - Headline: strong **character** performance; recall-leaning defaults  
> - Built for search, dataset curation, caption assistance, and text-to-image conditioning

---

## What it is (in one breath)

`pixai-tagger-v0.9` is a multi-label image classifier for anime images. It predicts Danbooru-style tags and aims to **find more of the right stuff** (recall) so you can filter later. We continued training the **classification head** of EVA02 (from WD v3) on a newer dataset, and used **embedding-space MixUp** to help calibration.

- **Last trained:** 2025-04  
- **Data snapshot:** Danbooru IDs 1–8,600,750 (2025-01)  
- **Finetuned from:** `SmilingWolf/wd-eva02-large-tagger-v3` (encoder frozen)  
- **License (weights):** Apache 2.0 *(Note: Danbooru content has its own licenses.)*  

---

## Why you might care

- **Newer data.** Catches more recent IPs/characters.  
- **Recall-first defaults.** Good for search and curation; dial thresholds for precision.  
- **Character focus.** We spent time here; it shows up in evals.  
- **Simple to run.** Works as an endpoint or locally; small set of knobs.

---

## Quickstart

**Recommended defaults (balanced):**

- `top_k = 128`  
- `threshold_general = 0.30`  
- `threshold_character = 0.75`  

**Coverage preset (recall-heavier):** `threshold_general = 0.10` (expect more false positives)

### 1) Inference Endpoint

Deploy as an HF Inference Endpoint and test with the following command:

```bash
# Replace with your own endpoint URL
curl "https://YOUR_ENDPOINT_URL.huggingface.cloud" \
  -X POST \
  -H "Accept: application/json" \
  -H "Content-Type: application/json" \
  -d '{
    "inputs": {"url": "https://your.cdn/image.jpg"},
    "parameters": {
      "top_k": 128,
      "threshold_general": 0.10,
      "threshold_character": 0.75
    }
  }'