Text-to-Audio
Diffusers
Safetensors
StableAudioPipeline
audio
sound-effects
ambient
diffusion
stable-audio
maestraea
Instructions to use AEmotionStudio/stable-audio-open-models with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use AEmotionStudio/stable-audio-open-models with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("AEmotionStudio/stable-audio-open-models", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
Mirror transformer/config.json from stabilityai/stable-audio-open-1.0
Browse files- transformer/config.json +15 -0
transformer/config.json
ADDED
|
@@ -0,0 +1,15 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"_class_name": "StableAudioDiTModel",
|
| 3 |
+
"_diffusers_version": "0.30.0.dev0",
|
| 4 |
+
"attention_head_dim": 64,
|
| 5 |
+
"cross_attention_dim": 768,
|
| 6 |
+
"cross_attention_input_dim": 768,
|
| 7 |
+
"global_states_input_dim": 1536,
|
| 8 |
+
"in_channels": 64,
|
| 9 |
+
"num_attention_heads": 24,
|
| 10 |
+
"num_key_value_attention_heads": 12,
|
| 11 |
+
"num_layers": 24,
|
| 12 |
+
"out_channels": 64,
|
| 13 |
+
"sample_size": 1024.0,
|
| 14 |
+
"time_proj_dim": 256
|
| 15 |
+
}
|