pixai-tagger / README.md
1038lab's picture
Create README.md
aa398e2 verified
---
license: apache-2.0
pipeline_tag: image-classification
tags:
- multi-label
- anime
- danbooru
- safetensors
---
ℹ️ This is a **safetensors + tag JSON version** of the original model [pixai-labs/pixai-tagger-v0.9](https://huggingface.co/pixai-labs/pixai-tagger-v0.9).
🔄 Converted by [1038lab](https://huggingface.co/1038lab).
📦 No changes were made to model weights or logic — only converted `.pth``.safetensors` and bundled with `tags_v0.9_13k.json` for convenience.
🧠 Full credits and model training go to PixAI Labs.
---
<p>
<img src="https://huggingface.co/pixai-labs/pixai-tagger-v0.9/resolve/main/banner_09_cropped.jpg" style="height:240px;" />
<a href="https://huggingface.co/pixai-labs/pixai-tagger-v0.9"><strong>Original Model</strong></a> ·
<a href="#quickstart"><strong>Quickstart</strong></a> ·
<a href="#training-notes"><strong>Training Notes</strong></a> ·
<a href="#credits"><strong>Credits</strong></a>
</p>
---
# PixAI Tagger v0.9
A practical anime **multi-label tagger**. Not trying to win benchmarks; trying to be useful.
**High recall**, updated **character coverage**, trained on a fresh Danbooru snapshot (2025-01).
We’ll keep shipping: **v1.0** (with updated tags) is next.
> TL;DR
>
> - ~**13.5k** Danbooru-style tags (**general**, **character**, **copyright**)
> - Headline: strong **character** performance; recall-leaning defaults
> - Built for search, dataset curation, caption assistance, and text-to-image conditioning
---
## What it is (in one breath)
`pixai-tagger-v0.9` is a multi-label image classifier for anime images. It predicts Danbooru-style tags and aims to **find more of the right stuff** (recall) so you can filter later. We continued training the **classification head** of EVA02 (from WD v3) on a newer dataset, and used **embedding-space MixUp** to help calibration.
- **Last trained:** 2025-04
- **Data snapshot:** Danbooru IDs 1–8,600,750 (2025-01)
- **Finetuned from:** `SmilingWolf/wd-eva02-large-tagger-v3` (encoder frozen)
- **License (weights):** Apache 2.0 *(Note: Danbooru content has its own licenses.)*
---
## Why you might care
- **Newer data.** Catches more recent IPs/characters.
- **Recall-first defaults.** Good for search and curation; dial thresholds for precision.
- **Character focus.** We spent time here; it shows up in evals.
- **Simple to run.** Works as an endpoint or locally; small set of knobs.
---
## Quickstart
**Recommended defaults (balanced):**
- `top_k = 128`
- `threshold_general = 0.30`
- `threshold_character = 0.75`
**Coverage preset (recall-heavier):** `threshold_general = 0.10` (expect more false positives)
### 1) Inference Endpoint
Deploy as an HF Inference Endpoint and test with the following command:
```bash
# Replace with your own endpoint URL
curl "https://YOUR_ENDPOINT_URL.huggingface.cloud" \
-X POST \
-H "Accept: application/json" \
-H "Content-Type: application/json" \
-d '{
"inputs": {"url": "https://your.cdn/image.jpg"},
"parameters": {
"top_k": 128,
"threshold_general": 0.10,
"threshold_character": 0.75
}
}'