File size: 4,976 Bytes

0997c23

# Gemini Util - Configurable Options Guide

## Overview

The Gemini Describe functionality has been enhanced with configurable options through two nodes:

1. **Gemini Util - Options**: Configuration node that defines how descriptions should be generated
2. **Gemini Util - Media Describe**: Processing node that accepts media and optional configuration

## New Gemini Util - Options Node

### Purpose

This node provides granular control over description generation by separating configuration from media processing. Connect this node's output to the `gemini_options` input of the Media Describe node.

### Configuration Options

#### API & Model Settings

-   **Gemini API Key**: Your Google Gemini API key
-   **Gemini Model**: Choose from available models (2.5-flash, 2.5-flash-lite, 2.5-pro)
-   **Model Type**: Text2Image or ImageEdit workflow type

#### Description Control (New Boolean Options)

-   **Describe Clothing?** [Yes/No]: Include detailed clothing and accessory descriptions
-   **Describe Hair Style?** [Yes/No]: Include hair texture and motion (but not color/length)
-   **Describe Bokeh?** [Yes/No]: Allow depth of field effects and blur descriptions
-   **Replace Action with Twerking?** [Yes/No]: Replace video movement description with twerking content

#### Text Options

-   **Prefix Text**: Text to prepend to generated descriptions

## Updated Media Describe Node

### Changes

-   **Removed**: API key, model selection, description mode combo box, prefix text
-   **Added**: Optional `gemini_options` input that accepts configuration from Options node
-   **Maintained**: All media handling capabilities (upload, random selection, image/video support)

### Backward Compatibility

When no Options node is connected, the Media Describe node uses these defaults:

-   Describe Clothing: No
-   Describe Hair Style: Yes
-   Describe Bokeh: Yes
-   Replace Action with Twerking: No
-   API Key: Default development key
-   Model: gemini-2.5-flash

## Usage Examples

### Basic Usage

1. Add "Gemini Util - Options" node to workflow
2. Configure desired settings
3. Add "Gemini Util - Media Describe" node
4. Connect Options node output to Media Describe `gemini_options` input
5. Provide media input (image tensor or upload)

### Option Combinations

#### For Minimal Descriptions (No Clothing, No Hair, No Bokeh)

-   Clean, simple descriptions focusing on core elements
-   Results in 3-paragraph structure: Subject, Cinematic, Style

#### For Detailed Fashion Analysis (Clothing + Hair)

-   Comprehensive descriptions including garments and hairstyles
-   Results in 4-paragraph structure when clothing enabled

#### For Sharp Focus Photography (No Bokeh)

-   Explicitly prevents depth-of-field language
-   Useful for product photography or architectural scenes

### System Prompt Improvements

#### Enhanced Decisiveness

All prompts now include instructions to avoid uncertain language:

-   ❌ "She appears to be wearing either lace tights or leggings"
-   ✅ "She wears black lace tights"

#### Hair Style Option

New granular control over hair descriptions:

-   When enabled: Includes texture and movement
-   When disabled: Completely omits hair references
-   Always excludes: Color and length (as before)

#### Dynamic Paragraph Structure

-   3 paragraphs: Subject, Cinematic, Style (no clothing)
-   4 paragraphs: Subject, Cinematic, Style, Clothing (with clothing)
-   5-6 paragraphs for video: Adds Scene and Movement sections

## Migration Guide

### For Existing Workflows

Existing Media Describe nodes will continue working with default settings. To use new features:

1. Add Options node to workflow
2. Configure desired settings
3. Connect to Media Describe node
4. Remove any hardcoded API keys from Media Describe node

### Option Mapping

Old combo box "Description Mode" maps to new options as follows:

-   "Describe without clothing" → Clothing: No, Hair: Yes, Bokeh: Yes
-   "Describe with clothing" → Clothing: Yes, Hair: Yes, Bokeh: Yes
-   "Describe without clothing (No bokeh)" → Clothing: No, Hair: Yes, Bokeh: No
-   "Describe with clothing (No bokeh)" → Clothing: Yes, Hair: Yes, Bokeh: No

## Benefits

1. **Modularity**: Configure once, use with multiple media nodes
2. **Flexibility**: Mix and match options as needed
3. **Clarity**: Each option has a clear purpose
4. **Extensibility**: Easy to add new options in the future
5. **Precision**: More deterministic descriptions without uncertainty

## Troubleshooting

### No Description Generated

-   Verify Options node is connected to Media Describe
-   Check API key is valid
-   Ensure media input is provided

### Unexpected Content

-   Review individual option settings
-   Check if hair/clothing/bokeh options match expectations
-   Verify model type (Text2Image vs ImageEdit) is appropriate

### Backward Compatibility Issues

-   Media Describe works standalone with defaults
-   Only connect Options node if you need custom configuration