Buckets:
AutoencoderKLLTX2Audio
The 3D variational autoencoder (VAE) model with KL loss used in LTX-2 was introduced by Lightricks. This is for encoding and decoding audio latent representations.
The model can be loaded with the following code snippet.
from diffusers import AutoencoderKLLTX2Audio
vae = AutoencoderKLLTX2Audio.from_pretrained("Lightricks/LTX-2", subfolder="vae", torch_dtype=torch.float32).to("cuda")
AutoencoderKLLTX2Audio[[diffusers.AutoencoderKLLTX2Audio]]
diffusers.AutoencoderKLLTX2Audio[[diffusers.AutoencoderKLLTX2Audio]]
LTX2 audio VAE for encoding and decoding audio latent representations.
wrapperdiffusers.AutoencoderKLLTX2Audio.encodehttps://github.com/huggingface/diffusers/blob/main/src/diffusers/utils/accelerate_utils.py#L43[{"name": "*args", "val": ""}, {"name": "**kwargs", "val": ""}]
wrapper[[diffusers.AutoencoderKLLTX2Audio.decode]]
forward[[diffusers.AutoencoderKLLTX2Audio.forward]]
Parameters:
sample (torch.Tensor) : Input sample.
sample_posterior (bool, optional, defaults to False) : Whether to sample from the posterior.
return_dict (bool, optional, defaults to True) : Whether or not to return a DecoderOutput instead of a plain tuple.
generator (torch.Generator, optional) : A torch.Generator to make sampling deterministic.
Xet Storage Details
- Size:
- 1.81 kB
- Xet hash:
- 28a5ff5539a1c1cf23cdd0b631cf6caa1141269c286f3bbf3c2a5a39b4d7f37f
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.