Diffusers
Safetensors
English
onkarsus13 commited on
Commit
1e7e21f
·
verified ·
1 Parent(s): 77f2261

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -1
README.md CHANGED
@@ -7,4 +7,50 @@ language:
7
  - en
8
  ---
9
 
10
- Trained Autoencoder on WAN2.1 for 1080p and 4K videos with language guidance Quntization
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  - en
8
  ---
9
 
10
+ # Pyramidal Spectrum
11
+ ### Frequency-based Hierarchically Vector Quantized VAE for Videos
12
+ **Official Implementation — WACV 2026**
13
+
14
+ This repository provides the **official implementation** of the paper:
15
+
16
+ **Pyramidal Spectrum: Frequency-based Hierarchically Vector Quantized VAE for Videos**
17
+ *Accepted at WACV 2026*
18
+
19
+ We introduce a **new autoencoder trained on 4K-resolution video data**, featuring a **hierarchical frequency-based vector quantization** method.
20
+ The model leverages a **pyramidal spectral representation** to produce high-fidelity video reconstructions with an efficient latent structure.
21
+
22
+ ---
23
+
24
+ ## 📦 Installation
25
+
26
+ This implementation requires installing Diffusers from the custom branch:
27
+
28
+ ```bash
29
+ pip install git+https://github.com/Onkarsus13/diffusers@MMVQVae
30
+ ```
31
+ ## 🚀 Features
32
+ - Novel **hierarchical frequency-domain quantization**
33
+ - Trained on **4K-resolution** video datasets
34
+ - Multi-level **pyramidal spectral decomposition**
35
+ - Highly efficient latent video representation
36
+ - High-quality reconstructions suitable for generative pipelines
37
+
38
+ ---
39
+
40
+
41
+ ```
42
+ @inproceedings{pyramidal_spectrum_wacv2026,
43
+ title = {Pyramidal Spectrum: Frequency-based Hierarchically Vector Quantized VAE for Videos},
44
+ author = {Tushar, Prakash and Onkar, Susladkar and Inderjit,
45
+ Inderjit Dhillon and Sparsh Mittal},
46
+ booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
47
+ year = {2026}
48
+ }
49
+ ```
50
+
51
+
52
+
53
+
54
+
55
+
56
+