Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -8,13 +8,13 @@
8
 
9
  <!-- Provide a quick summary of what the model is/does. -->
10
  ![row01](output_tile.gif)
11
- Stable Video Diffusion (SVD) Image-to-Video is a diffusion model that takes in a still image as a conditioning frame, and generates a video from it.
12
 
13
  ## Model Details
14
 
15
  ### Model Description
16
 
17
- (SVD) Image-to-Video is a latent diffusion model trained to generate short video clips from an image conditioning.
18
  This model was trained to generate 25 frames at resolution 576x1024 given a context frame of the same size, finetuned from [SVD Image-to-Video [14 frames]](https://huggingface.co/stabilityai/stable-video-diffusion-img2vid).
19
  We also finetune the widely used [f8-decoder](https://huggingface.co/docs/diffusers/api/models/autoencoderkl#loading-from-the-original-format) for temporal consistency.
20
  For convenience, we additionally provide the model with the
 
8
 
9
  <!-- Provide a quick summary of what the model is/does. -->
10
  ![row01](output_tile.gif)
11
+ Stable Video Diffusion (SVD) Image-to-Video is a diffusion model that takes in a still image as a conditioning frame and generates a video from it.
12
 
13
  ## Model Details
14
 
15
  ### Model Description
16
 
17
+ (SVD) Image-to-Video is a latent diffusion model trained to generate short video clips from image conditioning.
18
  This model was trained to generate 25 frames at resolution 576x1024 given a context frame of the same size, finetuned from [SVD Image-to-Video [14 frames]](https://huggingface.co/stabilityai/stable-video-diffusion-img2vid).
19
  We also finetune the widely used [f8-decoder](https://huggingface.co/docs/diffusers/api/models/autoencoderkl#loading-from-the-original-format) for temporal consistency.
20
  For convenience, we additionally provide the model with the