Instructions to use ruixiangma/LongCat-AudioDiT-1B-Diffusers with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use ruixiangma/LongCat-AudioDiT-1B-Diffusers with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("ruixiangma/LongCat-AudioDiT-1B-Diffusers", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
File size: 550 Bytes
3785ec6 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 | {
"_class_name": "LongCatAudioDiTVae",
"_diffusers_version": "0.38.0.dev0",
"act_fn": null,
"c_mults": [
1,
2,
4,
8,
16
],
"channels": 128,
"downsample_shortcut": "averaging",
"downsampling_ratio": 2048,
"encoder_latent_dim": 128,
"final_tanh": false,
"in_channels": 1,
"in_shortcut": "duplicating",
"latent_dim": 64,
"out_shortcut": "averaging",
"sample_rate": 24000,
"scale": 0.71,
"strides": [
2,
4,
4,
8,
8
],
"upsample_shortcut": "duplicating",
"use_snake": true
}
|