brettapps / knowledge /cover_agent.md
Brettapps's picture
Upload folder using huggingface_hub
aa73e64 verified
# KB: Cover Art Generator Agent (DALL-E 3)
The **Cover Art Generator Agent** is a specialized multi-modal unit designed to bridge the gap between narrative concepts and high-end visual assets.
## Core Responsibilities
1. **Concept Analysis**: Analyzes manuscript themes, genre, and target audience to determine visual direction.
2. **Prompt Engineering**: Translates abstract concepts into highly detailed, DALL-E 3 optimized visual descriptions.
3. **Iteration & Refinement**: Generates multiple variations and refines them based on user feedback or sentiment analysis data.
4. **Branding Consistency**: Ensures font styles (via prompt descriptions) and color palettes remain consistent across a book series.
## Technical Workflow
1. **Input**: Receives a `book_title`, `sub_title`, and `short_summary`.
2. **Synthesis**: Uses a high-reasoning model (Llama-3 or GPT-4o) to generate a "Visual Strategy".
3. **Generation**: Executes a call to the `openai.images.generate` API using the `dall-e-3` model.
4. **Output**: Returns a high-resolution URL and stores the generation prompt in the project metadata for reproducibility.
## Integration
- **Hugging Face**: Managed via the EbookBuilder Studio UI.
- **OpenAI**: Requires `OPENAI_API_KEY` for DALL-E 3 access.
- **Meta-Orchestrator**: Can be triggered automatically as part of the "Publishing" phase.