Buckets:

hf-doc-build/doc / diffusers /main /en /api /models /autoencoderkl_audio_ltx_2.md
|
download
raw
1.81 kB

AutoencoderKLLTX2Audio

The 3D variational autoencoder (VAE) model with KL loss used in LTX-2 was introduced by Lightricks. This is for encoding and decoding audio latent representations.

The model can be loaded with the following code snippet.

from diffusers import AutoencoderKLLTX2Audio

vae = AutoencoderKLLTX2Audio.from_pretrained("Lightricks/LTX-2", subfolder="vae", torch_dtype=torch.float32).to("cuda")

AutoencoderKLLTX2Audio[[diffusers.AutoencoderKLLTX2Audio]]

diffusers.AutoencoderKLLTX2Audio[[diffusers.AutoencoderKLLTX2Audio]]

Source

LTX2 audio VAE for encoding and decoding audio latent representations.

wrapperdiffusers.AutoencoderKLLTX2Audio.encodehttps://github.com/huggingface/diffusers/blob/main/src/diffusers/utils/accelerate_utils.py#L43[{"name": "*args", "val": ""}, {"name": "**kwargs", "val": ""}]

wrapper[[diffusers.AutoencoderKLLTX2Audio.decode]]

Source

forward[[diffusers.AutoencoderKLLTX2Audio.forward]]

Source

Parameters:

sample (torch.Tensor) : Input sample.

sample_posterior (bool, optional, defaults to False) : Whether to sample from the posterior.

return_dict (bool, optional, defaults to True) : Whether or not to return a DecoderOutput instead of a plain tuple.

generator (torch.Generator, optional) : A torch.Generator to make sampling deterministic.

Xet Storage Details

Size:
1.81 kB
·
Xet hash:
28a5ff5539a1c1cf23cdd0b631cf6caa1141269c286f3bbf3c2a5a39b4d7f37f

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.