Spaces:
Sleeping
Sleeping
primepake
commited on
Commit
·
201da39
1
Parent(s):
f973bf5
update model
Browse files
README.md
CHANGED
|
@@ -48,7 +48,8 @@ Maps discrete tokens to a continuous latent space using a Variational Autoencode
|
|
| 48 |
Before training the main model:
|
| 49 |
|
| 50 |
1. Extract discrete tokens using the trained FSQ [S3Tokenizer](https://github.com/xingchensong/S3Tokenizer)
|
| 51 |
-
2. Generate continuous latent representations using the trained DAC-VAE - the pretrained I provided [DAC-VAE](https://github.com/primepake/learnable-speech/releases/tag/dac-vae)
|
|
|
|
| 52 |
|
| 53 |
### 3. Two-Stage Training
|
| 54 |
|
|
|
|
| 48 |
Before training the main model:
|
| 49 |
|
| 50 |
1. Extract discrete tokens using the trained FSQ [S3Tokenizer](https://github.com/xingchensong/S3Tokenizer)
|
| 51 |
+
2. Generate continuous latent representations using the trained DAC-VAE - the pretrained I provided [DAC-VAE](https://github.com/primepake/learnable-speech/releases/tag/dac-vae)
|
| 52 |
+
- Notes: This model is trained with scale one fsq token will have 3 fractor of frame rate in dac-vae latent, will update 2 fractor soon
|
| 53 |
|
| 54 |
### 3. Two-Stage Training
|
| 55 |
|