Spaces:

mnhatdaous
/

learnable-speech

Sleeping

primepake commited on Jul 7

Commit

24941fa

1 Parent(s): 3595b78

add fsq training

Files changed (1) hide show

README.md CHANGED Viewed

@@ -11,6 +11,7 @@ This repository provides an implementation of the MiniMax-Speech model, featurin
 ## Key Features
 - [ ] **24kHz Audio Support**: High-quality audio generation at 24kHz sampling rate
 - [ ] **Two-Stage Architecture**: Optimized training pipeline with discrete and continuous representations
 - [ ] **Modular Design**: Separate components for audio codec and variational autoencoder
 - [ ] **CosyVoice2 Decoder**: Leverages proven components from the CosyVoice2's Decoder framework

 ## Key Features
 - [ ] **24kHz Audio Support**: High-quality audio generation at 24kHz sampling rate
+- [ ] **FSQ tokenizer training**: Training FSQ from scratch
 - [ ] **Two-Stage Architecture**: Optimized training pipeline with discrete and continuous representations
 - [ ] **Modular Design**: Separate components for audio codec and variational autoencoder
 - [ ] **CosyVoice2 Decoder**: Leverages proven components from the CosyVoice2's Decoder framework