Instructions to use aoiandroid/SoulX-FlashHead-1_3B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use aoiandroid/SoulX-FlashHead-1_3B with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline from diffusers.utils import load_image, export_to_video # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("aoiandroid/SoulX-FlashHead-1_3B", dtype=torch.bfloat16, device_map="cuda") pipe.to("cuda") prompt = "A man with short gray hair plays a red electric guitar." image = load_image( "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/guitar-man.png" ) output = pipe(image=image, prompt=prompt).frames[0] export_to_video(output, "output.mp4") - Notebooks
- Google Colab
- Kaggle
| { | |
| "_class_name": "WanModelAudioProject", | |
| "_diffusers_version": "0.36.0", | |
| "dim": 1536, | |
| "eps": 1e-06, | |
| "ffn_dim": 8960, | |
| "freq_dim": 256, | |
| "has_image_input": false, | |
| "in_dim": 32, | |
| "num_heads": 12, | |
| "num_layers": 30, | |
| "out_dim": 16, | |
| "patch_size": [ | |
| 1, | |
| 2, | |
| 2 | |
| ], | |
| "text_dim": 4096, | |
| "vae_stride": [ | |
| 4, | |
| 8, | |
| 8 | |
| ] | |
| } | |