| license: apache-2.0 | |
| tags: | |
| - text-to-image | |
| # AuraFlow v0.3 | |
|  | |
| AuraFlow v0.3 is the fully open-sourced flow-based text-to-image generation model. The model was trained with more compute compared to the previous version, [AuraFlow-v0.2](https://huggingface.co/fal/AuraFlow-v0.2). | |
| Compared to AuraFlow-v0.2, the model is fine-tuned on more aesthetic datasets and now supports various aspect ratio, (now width and height up to 1536 pixels). | |
| ## Usage | |
| ```bash | |
| $ pip install transformers accelerate protobuf sentencepiece | |
| $ pip install git+https://github.com/huggingface/diffusers.git | |
| ``` | |
| ```python | |
| from diffusers import AuraFlowPipeline | |
| import torch | |
| pipeline = AuraFlowPipeline.from_pretrained( | |
| "terminusresearch/auraflow-v0.3", | |
| torch_dtype=torch.float16, | |
| variant="fp16", | |
| ).to("cuda") | |
| image = pipeline( | |
| prompt="rempage of the iguana character riding F1, fast and furious, cinematic movie poster", | |
| width=1536, | |
| height=768, | |
| num_inference_steps=50, | |
| generator=torch.Generator().manual_seed(1), | |
| guidance_scale=3.5, | |
| ).images[0] | |
| image.save("output.png") | |
| ``` |