Update README.md
Browse files
README.md
CHANGED
|
@@ -1,6 +1,5 @@
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
-
pipeline_tag: image-to-video
|
| 4 |
tags:
|
| 5 |
- VAE
|
| 6 |
- Video-Generation
|
|
@@ -9,7 +8,7 @@ tags:
|
|
| 9 |
# Reducio-VAE Model Card
|
| 10 |
|
| 11 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 12 |
-
This model is a 3D VAE that encodes video into a compact latent space conditioned on a content frame. It compresses a video by a factor of $$\frac{T}{4}\times\frac{H}{32}\times\frac{W}{32}$$, enabling $4096\times$ downsampling.
|
| 13 |
It is part of the [Reducio-DiT](https://arxiv.org/abs/xxxx), which is a video generation method. Codebase available [here](https://github.com/microsoft/Reducio-VAE).
|
| 14 |
|
| 15 |
|
|
|
|
| 1 |
---
|
| 2 |
license: mit
|
|
|
|
| 3 |
tags:
|
| 4 |
- VAE
|
| 5 |
- Video-Generation
|
|
|
|
| 8 |
# Reducio-VAE Model Card
|
| 9 |
|
| 10 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 11 |
+
This model is a 3D VAE that encodes video into a compact latent space conditioned on a content frame. It compresses a video by a factor of \\($$\frac{T}{4}\times\frac{H}{32}\times\frac{W}{32}$$\\), enabling $4096\times$ downsampling.
|
| 12 |
It is part of the [Reducio-DiT](https://arxiv.org/abs/xxxx), which is a video generation method. Codebase available [here](https://github.com/microsoft/Reducio-VAE).
|
| 13 |
|
| 14 |
|