Buckets:

hf-doc-build/doc-dev / diffusers /pr_12652 /en /api /models /autoencoderkl_audio_ltx_2.md
rtrm's picture
|
download
raw
1.19 kB

AutoencoderKLLTX2Audio

The 3D variational autoencoder (VAE) model with KL loss used in LTX-2 was introduced by Lightricks. This is for encoding and decoding audio latent representations.

The model can be loaded with the following code snippet.

from diffusers import AutoencoderKLLTX2Audio

vae = AutoencoderKLLTX2Audio.from_pretrained("Lightricks/LTX-2", subfolder="vae", torch_dtype=torch.float32).to("cuda")

AutoencoderKLLTX2Audio[[diffusers.AutoencoderKLLTX2Audio]]

diffusers.AutoencoderKLLTX2Audio[[diffusers.AutoencoderKLLTX2Audio]]

Source

LTX2 audio VAE for encoding and decoding audio latent representations.

wrapperdiffusers.AutoencoderKLLTX2Audio.encodehttps://github.com/huggingface/diffusers/blob/vr_12652/src/diffusers/utils/accelerate_utils.py#L43[{"name": "*args", "val": ""}, {"name": "**kwargs", "val": ""}]

wrapper[[diffusers.AutoencoderKLLTX2Audio.decode]]

Source

Xet Storage Details

Size:
1.19 kB
·
Xet hash:
98458fecdf7d5a818970bd8be9b048d447ca7cbfe448471eb2248ca58723e37d

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.