Generate stable toy brick structures from text prompts.
Upgraded to v1.0!
Generate person images with new clothes or poses
Transcribe audio files or YouTube videos into text