Buckets:

hf-doc-build
/

doc-dev

Files

xet

hf-doc-build/doc-dev / diffusers /pr_12968 /en /api /models /autoencoder_kl_kvae_video.md

HuggingFaceDocBuilder

28 days ago

preview code

download

raw

3.07 kB

AutoencoderKLKVAEVideo

The 3D variational autoencoder (VAE) model with KL loss.

The model can be loaded with the following code snippet.

import torch
from diffusers import AutoencoderKLKVAEVideo

vae = AutoencoderKLKVAEVideo.from_pretrained("kandinskylab/KVAE-3D-1.0", subfolder="diffusers", torch_dtype=torch.float16)

AutoencoderKLKVAEVideo[[diffusers.AutoencoderKLKVAEVideo]]

diffusers.AutoencoderKLKVAEVideo[[diffusers.AutoencoderKLKVAEVideo]]

Source

A VAE model with KL loss for encoding videos into latents and decoding latent representations into videos. Used in KVAE.

This model inherits from ModelMixin. Check the superclass documentation for its generic methods implemented for all models (such as downloading or saving).

wrapperdiffusers.AutoencoderKLKVAEVideo.decodehttps://github.com/huggingface/diffusers/blob/vr_12968/src/diffusers/utils/accelerate_utils.py#L43[{"name": "*args", "val": ""}, {"name": "**kwargs", "val": ""}]

Parameters:

ch (int, optional, defaults to 128) : Base channel count.

ch_mult (Tuple[int], optional, defaults to (1, 2, 4, 8)) : Channel multipliers per level.

num_res_blocks (int, optional, defaults to 2) : Number of residual blocks per level.

in_channels (int, optional, defaults to 3) : Number of input channels.

out_ch (int, optional, defaults to 3) : Number of output channels.

z_channels (int, optional, defaults to 16) : Number of latent channels.

temporal_compress_times (int, optional, defaults to 4) : Temporal compression factor.

disable_slicing[[diffusers.AutoencoderKLKVAEVideo.disable_slicing]]

Source

Disable sliced VAE decoding.

enable_slicing[[diffusers.AutoencoderKLKVAEVideo.enable_slicing]]

Source

Enable sliced VAE decoding.

forward[[diffusers.AutoencoderKLKVAEVideo.forward]]

Source

Parameters:

sample (torch.Tensor) : Input sample.

sample_posterior (bool, optional, defaults to False) : Whether to sample from the posterior.

return_dict (bool, optional, defaults to True) : Whether or not to return a DecoderOutput instead of a plain tuple.

generator (torch.Generator, optional) : A torch.Generator to make sampling deterministic.

Returns:

~models.vae.DecoderOutput` or `tuple

If return_dict is True, a ~models.vae.DecoderOutput is returned, otherwise a plain tuple is returned.

Xet Storage Details

Size:: 3.07 kB
Xet hash:: df0b5622476b0a100cb38a18605884e1a5cb42d45c0e9f204cf275540c389d3d

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.