Add pipeline tag
#1
by
nielsr HF Staff - opened
README.md
CHANGED
|
@@ -3,12 +3,13 @@ license: mit
|
|
| 3 |
tags:
|
| 4 |
- VAE
|
| 5 |
- Video-Generation
|
|
|
|
| 6 |
---
|
| 7 |
|
| 8 |
# Reducio-VAE Model Card
|
| 9 |
|
| 10 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 11 |
-
This model is a 3D VAE that encodes video into a compact latent space conditioned on a content frame. It compresses a video by a factor of \\(\frac{T}{4}\times\frac{H}{32}\times\frac{W}{32}\\), enabling 4096x downsampling.
|
| 12 |
It is part of the [Reducio-DiT](https://arxiv.org/abs/2411.13552), which is a video generation method. Codebase available [here](https://github.com/microsoft/Reducio-VAE).
|
| 13 |
|
| 14 |
|
|
@@ -18,8 +19,8 @@ It is part of the [Reducio-DiT](https://arxiv.org/abs/2411.13552), which is a vi
|
|
| 18 |
|
| 19 |
<!-- Provide the basic links for the model. -->
|
| 20 |
|
| 21 |
-
-
|
| 22 |
-
-
|
| 23 |
|
| 24 |
## Uses
|
| 25 |
|
|
@@ -43,7 +44,7 @@ The model is typically used for supporting training a video diffusion model. Aft
|
|
| 43 |
|
| 44 |
Metrics on 1K Pexels validation set and UCF-101:
|
| 45 |
|
| 46 |
-
|Method|Downsample Factor|\|z\||PSNR |SSIM |LPIPS |rFVD (Pexels)|rFVD (UCF-101)|
|
| 47 |
|---------|---------------------|------------------|------------|--------------------|--------------|----------------|------------|
|
| 48 |
|SD2.1-VAE|1\*8\*8|4|29.23|0.82|0.09|25.96|21.00|
|
| 49 |
|SDXL-VAE|1\*8\*8|16|30.54|0.85|0.08|19.87|23.68|
|
|
|
|
| 3 |
tags:
|
| 4 |
- VAE
|
| 5 |
- Video-Generation
|
| 6 |
+
pipeline_tag: image-to-video
|
| 7 |
---
|
| 8 |
|
| 9 |
# Reducio-VAE Model Card
|
| 10 |
|
| 11 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 12 |
+
This model is a 3D VAE that encodes video into a compact latent space conditioned on a content frame. It compresses a video by a factor of \\(\\frac{T}{4}\times\frac{H}{32}\times\frac{W}{32}\\), enabling 4096x downsampling.
|
| 13 |
It is part of the [Reducio-DiT](https://arxiv.org/abs/2411.13552), which is a video generation method. Codebase available [here](https://github.com/microsoft/Reducio-VAE).
|
| 14 |
|
| 15 |
|
|
|
|
| 19 |
|
| 20 |
<!-- Provide the basic links for the model. -->
|
| 21 |
|
| 22 |
+
- **Repository:** [GitHub Repository](https://github.com/microsoft/Reducio-VAE)
|
| 23 |
+
- **Paper:** [arXiv](https://arxiv.org/abs/2411.13552)
|
| 24 |
|
| 25 |
## Uses
|
| 26 |
|
|
|
|
| 44 |
|
| 45 |
Metrics on 1K Pexels validation set and UCF-101:
|
| 46 |
|
| 47 |
+
|Method|Downsample Factor|\\|z\\||PSNR |SSIM |LPIPS |rFVD (Pexels)|rFVD (UCF-101)|
|
| 48 |
|---------|---------------------|------------------|------------|--------------------|--------------|----------------|------------|
|
| 49 |
|SD2.1-VAE|1\*8\*8|4|29.23|0.82|0.09|25.96|21.00|
|
| 50 |
|SDXL-VAE|1\*8\*8|16|30.54|0.85|0.08|19.87|23.68|
|