VisionHarvester v1 β Identity-Safe Image Style & Pose Extractor
By GritAI Solutions LLC
VisionHarvester v1 is a lightweight prompt-based extraction tool for creators working with Stable Diffusion, ComfyUI, SDXL, LoRA training, and Qwen-VL workflows.
It converts a reference image into clean, modular, identity-safe text blocks you can reuse for character building, style replication, dataset generation, and scene reconstruction β without copying real people.
π What VisionHarvester v1 Does
VisionHarvester is built around five reusable components:
1. Base Identity (Safe & Generic)
A neutral description of the subject that includes:
- General body type
- Hair color and basic hairstyle
- Clothing and fabric behavior
- Broad, non-identifying facial description
No pose, no emotion, no personality.
2. Pose (Geometry Only)
Short, comma-separated fragments describing:
- Limb positions
- Body orientation
- Weight distribution
- Head/hip angles
No outfit, no style, no emotion.
3. Outfit & Materials
Details about:
- Clothing type and cut
- Colors
- Fabric texture and behavior (matte, glossy, stretchy)
- Accessories
4. Camera & Lighting
Information about:
- Framing (close-up, half body, full body)
- Camera angle / lens feel
- Lighting direction and softness
- Major shadows and highlights
5. Style Tags
Reusable tags such as:
- studio fitness look
- clean background
- soft cinematic lighting
- high-resolution texture
These drop straight into Stable Diffusion prompts.
π Included in This Repository
- README.md β this documentation
- LICENSE β MIT license
- isionharvester_v1_extractor.prompt.txt β main extraction prompt
- VisionHarvester-PoseStyleExtractor.json β ComfyUI workflow (optional)
- examples/ β sample images and their extracted outputs
Example files:
- examples/example_01.png
- examples/example_01_output.txt
- examples/example_02.png
- examples/example_02_output.txt
- examples/example_03.png
- examples/example_03_output.txt
π© Main Extraction Prompt (v1)
This is the core VisionHarvester v1 prompt shipped in isionharvester_v1_extractor.prompt.txt:
\
Extract a clean, neutral description of the woman in the image.
Keep it simple: β’ No pose or body positioning β’ No emotions or personality β’ No unique facial identifiers β’ No NSFW content β’ Do describe hair, body type (general), clothing, colors, fabrics, and broad facial features
Output 2β4 sentences that would work as a Stable Diffusion base identity block. \\
Use this in:
- Qwen-VL custom prompt
- Any Vision-LLM
- ComfyUI Qwen nodes
- Image-to-text or SD prompt pipelines
πΌ Example Outputs
Example of the kind of identity-safe text this prompt produces:
\
An athletic woman with long dark hair, a medium tan complexion, and soft neutral facial features without distinctive identifiers. She is wearing a fitted black sports bra made from matte stretch fabric and high-waisted leggings. Her appearance is clean, simple, and suitable as a Stable Diffusion base identity.
\\
π Identity & Safety
VisionHarvester v1 is designed to:
- Avoid 1:1 face cloning
- Avoid unique facial markers
- Avoid real-person or celebrity references
- Avoid explicit or NSFW content
It focuses on style, clothing, pose, and scene β not identity.
π§© Use Cases
- Character consistency
- Pose and outfit reuse
- LoRA dataset prep
- Style transfer
- Scene reconstruction
- Visual prompt creation
- Multi-lane ComfyUI pipelines
π§± Author
GritAI Solutions LLC
Robert "BonusLockSmith" Lucyk
Lawton, Oklahoma
MIT licensed. Free for personal and commercial use.