diffusers-internal-dev
/

gemini-prompt-expander

Model card Files Files and versions

gemini-prompt-expander / README.md

YiYiXu's picture

Update README.md

ace4b52 verified 1 day ago

|

1.17 kB

	# Gemini Prompt Expander

	A custom ModularPipelineBlocks that uses Google's Gemini API to expand short prompts into detailed, vivid image generation prompts.

	## Requirement

	Install the Google Generative AI package:
	```bash
	pip install google-generativeai
	```

	## Setup

	Get your Gemini API key from [Google AI Studio](https://aistudio.google.com/apikey) and set it as an environment variable:
	```bash
	export GOOGLE_API_KEY="your-api-key-here"
	```

	## Usage
	```python
	from diffusers.modular_pipelines import ModularPipelineBlocks

	gemini_block = ModularPipelineBlocks.from_pretrained(
	"diffusers-internal-dev/gemini-prompt-expander",
	trust_remote_code=True,
	)
	gemini = gemini_block.init_pipeline()
	output = gemini(prompt="a dog sitting by the river, watching the sunset")
	print(f"{output.values['prompt']=}")
	```

	## Use in Mellon

	This block includes a `mellon_pipeline_config.json` for use with [Mellon](https://github.com/cubiq/Mellon):

	1. Drag a Dynamic Block Node from the ModularDiffusers section
	2. Enter `diffusers-internal-dev/gemini-prompt-expander` as the repo_id
	3. The node will transform to show the prompt input and expanded prompt output