Generate images from text prompts
The agent using over 9000 vision models from the HF Hub.
Large Animatable Human Model