Buckets:

hf-doc-build/doc-dev / diffusers /pr_11739 /en /api /models /autoencoderkl_audio_ltx_2.md
rtrm's picture
|
download
raw
1.19 kB

AutoencoderKLLTX2Audio

The 3D variational autoencoder (VAE) model with KL loss used in LTX-2 was introduced by Lightricks. This is for encoding and decoding audio latent representations.

The model can be loaded with the following code snippet.

from diffusers import AutoencoderKLLTX2Audio

vae = AutoencoderKLLTX2Audio.from_pretrained("Lightricks/LTX-2", subfolder="vae", torch_dtype=torch.float32).to("cuda")

AutoencoderKLLTX2Audio[[diffusers.AutoencoderKLLTX2Audio]]

diffusers.AutoencoderKLLTX2Audio[[diffusers.AutoencoderKLLTX2Audio]]

Source

LTX2 audio VAE for encoding and decoding audio latent representations.

wrapperdiffusers.AutoencoderKLLTX2Audio.encodehttps://github.com/huggingface/diffusers/blob/vr_11739/src/diffusers/utils/accelerate_utils.py#L43[{"name": "*args", "val": ""}, {"name": "**kwargs", "val": ""}]

wrapper[[diffusers.AutoencoderKLLTX2Audio.decode]]

Source

Xet Storage Details

Size:
1.19 kB
·
Xet hash:
1c38c0ff92e8c83d2c903e4e069c5628d957c6515e8b7f3a413a89dd251d36fe

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.