Instructions to use jeffreyCheung/audioldm2-large with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use jeffreyCheung/audioldm2-large with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("jeffreyCheung/audioldm2-large", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
File size: 541 Bytes
635712e | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 | {
"chunk_length_s": 10,
"feature_extractor_type": "ClapFeatureExtractor",
"feature_size": 64,
"fft_window_size": 1024,
"frequency_max": 14000,
"frequency_min": 50,
"hop_length": 480,
"max_length_s": 10,
"n_fft": 1024,
"nb_frequency_bins": 513,
"nb_max_frames": 1000,
"nb_max_samples": 480000,
"padding": "repeatpad",
"padding_side": "right",
"padding_value": 0.0,
"processor_class": "ClapProcessor",
"return_attention_mask": false,
"sampling_rate": 48000,
"top_db": null,
"truncation": "rand_trunc"
}
|