| license: creativeml-openrail-m | |
| base_model: CompVis/stable-diffusion-v1-4 | |
| training_prompt: A person of East Asian ethnicity standing in a well-lit office environment with large windows, speaking directly to the camera, wearing a white shirt, and conveying a calm and professional demeanor. The background is slightly blurred, showcasing greenery and modern office decor, with minimal distractions | |
| tags: | |
| - stable-diffusion | |
| - stable-diffusion-diffusers | |
| - text-to-image | |
| - diffusers | |
| - text-to-video | |
| - tune-a-video | |
| inference: false | |
| # Tune-A-Video - talking-man | |
| ## Model description | |
| - Base model: [CompVis/stable-diffusion-v1-4](https://huggingface.co/CompVis/stable-diffusion-v1-4) | |
| - Training prompt: A person of East Asian ethnicity standing in a well-lit office environment with large windows, speaking directly to the camera, wearing a white shirt, and conveying a calm and professional demeanor. The background is slightly blurred, showcasing greenery and modern office decor, with minimal distractions | |
| ## Related papers: | |
| - [Tune-A-Video](https://arxiv.org/abs/2212.11565): One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation | |
| - [Stable-Diffusion](https://arxiv.org/abs/2112.10752): High-Resolution Image Synthesis with Latent Diffusion Models | |