luxdelux7 commited on
Commit
726f767
·
verified ·
1 Parent(s): 6f22789

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -50
README.md CHANGED
@@ -3,102 +3,110 @@ license: apache-2.0
3
  ---
4
 
5
  <div align="center">
6
-
7
  <img src="images/header.webp" width="800px" />
8
 
9
- Custom-trained models for face detection, landmark estimation, and segmentation across realistic, anime, and NSFW content.
10
- Made for soon to be released **Forbidden Vision** ComfyUI custom nodes
11
- <a href="https://github.com/luxdelux7/ComfyUI-Forbidden-Vision">GitHub Repository</a>
12
 
 
13
  <a href="https://ko-fi.com/luxdelux" target="_blank">
14
  <img src="https://ko-fi.com/img/githubbutton_sm.svg" alt="Support me on Ko-fi">
15
  </a>
16
-
17
  </div>
18
 
19
  ---
20
 
21
- ## Models
22
-
23
- | Model | Purpose | Architecture |
24
- |-------|---------|--------------|
25
- | **Face Detection** | Locate faces in images | YOLOv11-small |
26
- | **Landmark Detection** | Facial orientation & alignment | MobileNetV4-Conv-Small |
27
- | **Segmentation** | Precise face masks | GhostNetV2-100 |
28
 
29
- ## What Makes These Different?
30
 
31
- **Trained on what you actually generate.**
 
 
 
32
 
33
- Unlike general-purpose face models trained only on real photographs, these models were specifically trained on:
34
 
35
- - **Diffusion outputs** from CivitAI (realistic and anime styles)
36
- - **Danbooru** anime dataset
37
- - **Real photography** for ground truth accuracy
38
- - **NSFW content** without artificial restrictions
39
-
40
- This mixed training approach ensures reliable performance across:
41
  - ✓ Both realistic and anime art styles
42
  - ✓ Difficult angles, occlusions, and expressions
43
  - ✓ Low-quality generations and artifacts
44
  - ✓ SFW and NSFW content equally
45
  - ✓ Edge cases that break traditional face detectors
46
 
 
 
47
  ## Model Details
48
 
49
  ### Face Detection (YOLOv11-Small)
50
 
51
- Primary face detection and bounding box localization.
52
 
53
- **Training Data:**
54
- - 50k+ images from CivitAI diffusion outputs (SDXL, SD1.5, Pony)
55
- - 30k+ Danbooru anime images
56
- - 20k+ real photographs
57
 
58
- **Features:**
59
- - Handles extreme angles and rotations
60
- - Detects partially occluded faces
61
- - Works with artistic distortions and stylization
62
- - Reliable on both photorealistic and anime styles
63
 
64
  ### Landmark Detection (MobileNetV4-Conv-Small)
65
 
66
- Facial orientation and alignment estimation with 68-point landmark regression.
67
 
68
- **Training Data:**
69
- - 40k+ annotated faces from mixed sources
70
- - Manual annotation for anime/artistic styles
71
- - Augmented with rotation and perspective transforms
72
 
73
- **Features:**
74
- - Fast orientation detection for auto-rotation
75
- - Works on both realistic and anime faces
76
- - Handles partial occlusions and unusual expressions
77
 
78
  ### Segmentation (GhostNetV2-100)
79
 
80
- Precise face mask generation including hair and complex boundaries.
81
 
82
- **Training Data:**
83
- - 35k+ faces with pixel-accurate mask annotations
84
- - Mixed realistic, anime, and artistic styles
85
- - Includes hair, accessories, and complex boundaries
86
 
87
  **Features:**
88
- - Pixel-accurate boundaries including hair
89
- - Handles complex occlusions (hands, objects)
90
- - Works with artistic styles and effects
91
  - Smooth mask edges for seamless blending
92
 
 
 
93
  ## Usage
94
 
95
  These models are automatically downloaded and used by the **Fixer** node in ComfyUI Forbidden Vision.
96
 
 
 
 
 
 
 
 
 
 
 
 
97
  ## License
98
 
99
- GNU General Public License v3.0
 
 
 
 
100
 
101
  ## Contact
102
 
103
- - GitHub Issues: [ComfyUI-Forbidden-Vision](https://github.com/luxdelux7/ComfyUI-Forbidden-Vision/issues)
 
104
  - Support: [Ko-fi](https://ko-fi.com/luxdelux)
 
3
  ---
4
 
5
  <div align="center">
 
6
  <img src="images/header.webp" width="800px" />
7
 
8
+ Custom-trained models for face detection, landmark estimation, and segmentation across realistic, anime, and NSFW content.
9
+
10
+ Made for the **Forbidden Vision** ComfyUI custom nodes
11
 
12
+ <a href="https://github.com/luxdelux7/ComfyUI-Forbidden-Vision">GitHub Repository</a>
13
  <a href="https://ko-fi.com/luxdelux" target="_blank">
14
  <img src="https://ko-fi.com/img/githubbutton_sm.svg" alt="Support me on Ko-fi">
15
  </a>
 
16
  </div>
17
 
18
  ---
19
 
20
+ ## Dataset
 
 
 
 
 
 
21
 
22
+ All three models share a core mixed-domain dataset specifically curated for diffusion-generated content:
23
 
24
+ - images from CivitAI diffusion outputs (SDXL, SD1.5, Pony, Illustrious)
25
+ - curated images from Danbooru (mixed anime styles)
26
+ - real photographs from various sources
27
+ - NSFW images from each domain without filtering
28
 
29
+ **Total dataset size:** ~11k manually annotated images
30
 
31
+ This mixed approach ensures the models work reliably across:
 
 
 
 
 
32
  - ✓ Both realistic and anime art styles
33
  - ✓ Difficult angles, occlusions, and expressions
34
  - ✓ Low-quality generations and artifacts
35
  - ✓ SFW and NSFW content equally
36
  - ✓ Edge cases that break traditional face detectors
37
 
38
+ ---
39
+
40
  ## Model Details
41
 
42
  ### Face Detection (YOLOv11-Small)
43
 
44
+ **Purpose:** Primary face detection and bounding box localization
45
 
46
+ **Training Approach:**
47
+ - [TRAINING DETAILS PLACEHOLDER]
48
+ - Trained at 640px resolution (inference should use same resolution)
49
+ - [AUGMENTATION DETAILS PLACEHOLDER]
50
 
51
+ **Why YOLOv11-Small instead of nano?**
52
+ More reliable detection across mixed realistic/anime domains with acceptable speed tradeoff.
53
+
54
+ ---
 
55
 
56
  ### Landmark Detection (MobileNetV4-Conv-Small)
57
 
58
+ **Purpose:** Eyes and mouth landmark detection for face alignment
59
 
60
+ **Training Approach:**
61
+ - [TRAINING DETAILS PLACEHOLDER]
62
+ - Focused on eyes and mouth keypoints only
63
+ - [AUGMENTATION DETAILS PLACEHOLDER]
64
 
65
+ **Note:** The auto-rotation feature in the Fixer node is handled by post-processing scripts, not the model itself.
66
+
67
+ ---
 
68
 
69
  ### Segmentation (GhostNetV2-100)
70
 
71
+ **Purpose:** Precise face mask generation including hair and complex boundaries
72
 
73
+ **Training Approach:**
74
+ - [TRAINING DETAILS PLACEHOLDER]
75
+ - Pixel-accurate mask annotations
76
+ - [AUGMENTATION DETAILS PLACEHOLDER]
77
 
78
  **Features:**
79
+ - Includes hair, face, and neck regions
80
+ - Handles complex occlusions (hands, objects, accessories)
 
81
  - Smooth mask edges for seamless blending
82
 
83
+ ---
84
+
85
  ## Usage
86
 
87
  These models are automatically downloaded and used by the **Fixer** node in ComfyUI Forbidden Vision.
88
 
89
+ **Manual Usage:**
90
+
91
+ ```python
92
+ # Face Detection (640px inference)
93
+ from ultralytics import YOLO
94
+ model = YOLO('fv_face_detect_yolo11s.pt')
95
+ results = model(image, imgsz=640)
96
+ ```
97
+
98
+ ---
99
+
100
  ## License
101
 
102
+ Apache 2.0
103
+
104
+ The ComfyUI node pack is GPL-3.0 due to Ultralytics dependencies, but these model weights are released under Apache 2.0.
105
+
106
+ ---
107
 
108
  ## Contact
109
 
110
+ - GitHub: [ComfyUI-Forbidden-Vision](https://github.com/luxdelux7/ComfyUI-Forbidden-Vision)
111
+ - Issues: [GitHub Issues](https://github.com/luxdelux7/ComfyUI-Forbidden-Vision/issues)
112
  - Support: [Ko-fi](https://ko-fi.com/luxdelux)