Update README.md
Browse files
README.md
CHANGED
|
@@ -25,9 +25,10 @@ Traditional face models fail where it matters most for AI art workflows:
|
|
| 25 |
|-------------|-------------------|
|
| 26 |
| 🎨 **Domain-locked** | Existing models excel at *either* anime *or* realistic—never both |
|
| 27 |
| 🔞 **NSFW blindness** | Most models trained only on SFW data break on adult content |
|
|
|
|
| 28 |
| 🎲 **Generation artifacts** | Standard datasets don't include diffusion model quirks and failures |
|
| 29 |
|
| 30 |
-
**These models solve all
|
| 31 |
|
| 32 |
---
|
| 33 |
|
|
@@ -35,7 +36,7 @@ Traditional face models fail where it matters most for AI art workflows:
|
|
| 35 |
|
| 36 |
### The Dataset Difference
|
| 37 |
|
| 38 |
-
Built from **
|
| 39 |
|
| 40 |
<table>
|
| 41 |
<tr>
|
|
@@ -79,11 +80,10 @@ These models: Trained on what you actually generate
|
|
| 79 |
|
| 80 |
### Face Detection (YOLOv11-Small)
|
| 81 |
|
| 82 |
-
**Purpose:** Primary face detection with high recall
|
| 83 |
|
| 84 |
**Training Approach:**
|
| 85 |
- After every training run, I ran the model on a new mixed dataset, hardmining failures and improving the dataset until an acceptable performance was reached
|
| 86 |
-
- Used offline custom augmentation on the initial set to complement light yolo training script augmentations
|
| 87 |
- Trained at 640px resolution (inference should use same resolution)
|
| 88 |
|
| 89 |
**Why YOLOv11-Small instead of nano?**
|
|
@@ -92,19 +92,21 @@ More reliable detection across mixed realistic/anime domains with acceptable spe
|
|
| 92 |
---
|
| 93 |
|
| 94 |
|
| 95 |
-
### Segmentation (
|
| 96 |
|
| 97 |
-
**Purpose:** Precise face mask generation
|
| 98 |
|
| 99 |
**Training Approach:**
|
| 100 |
-
-
|
| 101 |
-
-
|
| 102 |
-
|
|
|
|
|
|
|
| 103 |
|
| 104 |
**Features:**
|
| 105 |
-
-
|
| 106 |
-
-
|
| 107 |
-
-
|
| 108 |
|
| 109 |
---
|
| 110 |
|
|
|
|
| 25 |
|-------------|-------------------|
|
| 26 |
| 🎨 **Domain-locked** | Existing models excel at *either* anime *or* realistic—never both |
|
| 27 |
| 🔞 **NSFW blindness** | Most models trained only on SFW data break on adult content |
|
| 28 |
+
| 👁️🗨️ **Detail blindness** | Most models miss anime eyebrows, real eyelashes etc. |
|
| 29 |
| 🎲 **Generation artifacts** | Standard datasets don't include diffusion model quirks and failures |
|
| 30 |
|
| 31 |
+
**These models solve all 4.**
|
| 32 |
|
| 33 |
---
|
| 34 |
|
|
|
|
| 36 |
|
| 37 |
### The Dataset Difference
|
| 38 |
|
| 39 |
+
Built from **14,000+ manually annotated images** across the domains that actually matter for AI generation:
|
| 40 |
|
| 41 |
<table>
|
| 42 |
<tr>
|
|
|
|
| 80 |
|
| 81 |
### Face Detection (YOLOv11-Small)
|
| 82 |
|
| 83 |
+
**Purpose:** Primary face detection with high recall
|
| 84 |
|
| 85 |
**Training Approach:**
|
| 86 |
- After every training run, I ran the model on a new mixed dataset, hardmining failures and improving the dataset until an acceptable performance was reached
|
|
|
|
| 87 |
- Trained at 640px resolution (inference should use same resolution)
|
| 88 |
|
| 89 |
**Why YOLOv11-Small instead of nano?**
|
|
|
|
| 92 |
---
|
| 93 |
|
| 94 |
|
| 95 |
+
### Segmentation (EfficientNet-v2)
|
| 96 |
|
| 97 |
+
**Purpose:** Precise face mask generation
|
| 98 |
|
| 99 |
**Training Approach:**
|
| 100 |
+
- Dataset prepared using the Forbidden Vision yolo model at 512px resolution
|
| 101 |
+
- Iterative hardmine training in multiple phases:
|
| 102 |
+
-- started with 700 samples, trained
|
| 103 |
+
-- run trained model on untrained set of images -> pick failures -> correct -> include in total set and retrain
|
| 104 |
+
-- repeat process until no obvious failures exist -> final set 4k+ images
|
| 105 |
|
| 106 |
**Features:**
|
| 107 |
+
- Detects and includes facial features other models ignore, like protruding anime eybrows, realistic eyelashes sticking out of the face etc.
|
| 108 |
+
- Glasses and similar are treated as part of the face, even if sticking outside the face shape
|
| 109 |
+
- NSFW friendly across both anime, realistic and 3d domains
|
| 110 |
|
| 111 |
---
|
| 112 |
|