Generate stable toy brick structures from text prompts.
Upgraded to v1.0!
Generate new person images with swapped clothes or poses
Transcribe audio files into text