initial commit
Browse filesThis view is limited to 50 files because it contains too many changes.
See raw diff
- .gitattributes +2 -0
- README.md +102 -0
- assets/concept.png +3 -0
- wavlm24_wavefit5/config.yaml +3 -0
- wavlm24_wavefit5/discriminator.pth +3 -0
- wavlm24_wavefit5/generator.pth +3 -0
- wavlm24_wavefit5/opt_d.pth +3 -0
- wavlm24_wavefit5/opt_g.pth +3 -0
- wavlm24_wavefit5/sche_d.pth +3 -0
- wavlm24_wavefit5/sche_g.pth +3 -0
- wavlm24_wavetrainerfit5/config.yaml +3 -0
- wavlm24_wavetrainerfit5/discriminator.pth +3 -0
- wavlm24_wavetrainerfit5/generator.pth +3 -0
- wavlm24_wavetrainerfit5/opt_d.pth +3 -0
- wavlm24_wavetrainerfit5/opt_g.pth +3 -0
- wavlm24_wavetrainerfit5/sche_d.pth +3 -0
- wavlm24_wavetrainerfit5/sche_g.pth +3 -0
- wavlm2_wavefit5/config.yaml +3 -0
- wavlm2_wavefit5/discriminator.pth +3 -0
- wavlm2_wavefit5/generator.pth +3 -0
- wavlm2_wavefit5/opt_d.pth +3 -0
- wavlm2_wavefit5/opt_g.pth +3 -0
- wavlm2_wavefit5/sche_d.pth +3 -0
- wavlm2_wavefit5/sche_g.pth +3 -0
- wavlm2_wavetrainerfit5/config.yaml +3 -0
- wavlm2_wavetrainerfit5/discriminator.pth +3 -0
- wavlm2_wavetrainerfit5/generator.pth +3 -0
- wavlm2_wavetrainerfit5/opt_d.pth +3 -0
- wavlm2_wavetrainerfit5/opt_g.pth +3 -0
- wavlm2_wavetrainerfit5/sche_d.pth +3 -0
- wavlm2_wavetrainerfit5/sche_g.pth +3 -0
- wavlm8_wavefit5/config.yaml +3 -0
- wavlm8_wavefit5/discriminator.pth +3 -0
- wavlm8_wavefit5/generator.pth +3 -0
- wavlm8_wavefit5/opt_d.pth +3 -0
- wavlm8_wavefit5/opt_g.pth +3 -0
- wavlm8_wavefit5/sche_d.pth +3 -0
- wavlm8_wavefit5/sche_g.pth +3 -0
- wavlm8_wavetrainerfit5/config.yaml +3 -0
- wavlm8_wavetrainerfit5/discriminator.pth +3 -0
- wavlm8_wavetrainerfit5/generator.pth +3 -0
- wavlm8_wavetrainerfit5/opt_d.pth +3 -0
- wavlm8_wavetrainerfit5/opt_g.pth +3 -0
- wavlm8_wavetrainerfit5/sche_d.pth +3 -0
- wavlm8_wavetrainerfit5/sche_g.pth +3 -0
- whisper8_wavefit5/config.yaml +3 -0
- whisper8_wavefit5/discriminator.pth +3 -0
- whisper8_wavefit5/generator.pth +3 -0
- whisper8_wavefit5/opt_d.pth +3 -0
- whisper8_wavefit5/opt_g.pth +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
*.yaml filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
*.png filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,102 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
language:
|
| 3 |
+
- en
|
| 4 |
+
tags:
|
| 5 |
+
- speech
|
| 6 |
+
license:
|
| 7 |
+
- cc-by-sa-3.0
|
| 8 |
+
- cc-by-4.0
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
# Wave-Trainer-Fit | Neural vocoder from SSL features
|
| 12 |
+
|
| 13 |
+
[[Code of Wave-Trainer-Fit](https://github.com/line/WaveTrainerFit)][[audio samples](https://i17oonaka-h.github.io/projects/research_topics/wave_trainer_fit/)]
|
| 14 |
+
|
| 15 |
+
>**Abstract:**<br>
|
| 16 |
+
We propose WaveTrainerFit, a neural vocoder that performs high-quality waveform generation from data-driven features such as SSL features. WaveTrainerFit builds upon the WaveFit vocoder, which integrates diffusion model and generative adversarial network. Furthermore, the proposed method incorporates the following key improvements: 1. By introducing trainable priors, the inference process starts from noise close to the target speech instead of Gaussian noise. 2. Reference-aware gain adjustment is performed by imposing constraints on the trainable prior to matching the speech energy. These improvements are expected to reduce the complexity of waveform modeling from data-driven features, enabling high-quality waveform generation with fewer inference steps. Through experiments, we showed that WaveTrainerFit can generate highly natural waveforms with improved speaker similarity from data-driven features, while requiring fewer iterations than WaveFit. Moreover, we showed that the proposed method works robustly with respect to the depth at which SSL features are extracted.
|
| 17 |
+
|
| 18 |
+

|
| 19 |
+
|
| 20 |
+
This repository provides pre-trained models and their optimizers.
|
| 21 |
+
The models were pre-trained on [LibriTTS-R](https://www.openslr.org/141/) (train-clean-360).
|
| 22 |
+
Also, these models operate to reconstruct 24kHz audio by taking SSL features from 16kHz audio as input.
|
| 23 |
+
|
| 24 |
+
## Pre-trained model list
|
| 25 |
+
> [!IMPORTANT]
|
| 26 |
+
⚠️ **License Notice**: The model weights provided in this repository are licensed under different terms. The `xlsr*` and `whisper*` models are licensed differently from the `wavlm*` models. Please refer to the [License](#license) section for details.
|
| 27 |
+
|
| 28 |
+
The list of available models is as follows:
|
| 29 |
+
|
| 30 |
+
| Model-name | Conditional features | Layer num | #iters of model |
|
| 31 |
+
|:------| :---------: | :---: | :---: |
|
| 32 |
+
| wavlm2_wavetrainerfit5 | [WavLM-large](https://huggingface.co/microsoft/wavlm-large) | 2 | 5 |
|
| 33 |
+
| wavlm2_wavefit5 | [WavLM-large](https://huggingface.co/microsoft/wavlm-large) | 2 | 5 |
|
| 34 |
+
| wavlm8_wavetrainerfit5 | [WavLM-large](https://huggingface.co/microsoft/wavlm-large) | 8| 5 |
|
| 35 |
+
| wavlm8_wavefit5 | [WavLM-large](https://huggingface.co/microsoft/wavlm-large) | 8| 5 |
|
| 36 |
+
| wavlm24_wavetrainerfit5 | [WavLM-large](https://huggingface.co/microsoft/wavlm-large) | 24| 5 |
|
| 37 |
+
| wavlm24_wavefit5 | [WavLM-large](https://huggingface.co/microsoft/wavlm-large) | 24| 5 |
|
| 38 |
+
| xlsr8_wavetrainerfit5 | [XLS-R-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) | 8| 5 |
|
| 39 |
+
| xlsr8_wavefit5 | [XLS-R-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) | 8| 5 |
|
| 40 |
+
| whisper8_wavetrainerfit5 | ※ [Whisper-medium](https://huggingface.co/openai/whisper-medium) | 8| 5 |
|
| 41 |
+
| whisper8_wavefit5 | ※ [Whisper-medium](https://huggingface.co/openai/whisper-medium) | 8| 5 |
|
| 42 |
+
|
| 43 |
+
※ As a result of our verification, we found that amplitude decay occurs in Whisper features after about 2.0 seconds.
|
| 44 |
+
During evaluation, our model processed inputs by dividing them into `2.0-second segments → extracting features with the Whisper encoder → recombining → resynthesizing`.
|
| 45 |
+
If you use this model in your application, the upstream feature extraction must also follow this flow.
|
| 46 |
+
|
| 47 |
+
## Usage
|
| 48 |
+
Please refer to [our GitHub repository](https://github.com/line/WaveTrainerFit) for instructions on how to use the models provided here.
|
| 49 |
+
The following is reproduced from the GitHub repository:
|
| 50 |
+
```python
|
| 51 |
+
import torchaudio
|
| 52 |
+
import torch
|
| 53 |
+
from wavetrainerfit import load_pretrained_vocoder
|
| 54 |
+
from transformers import WavLMModel, AutoFeatureExtractor
|
| 55 |
+
|
| 56 |
+
ssl_preprocessor = AutoFeatureExtractor.from_pretrained('microsoft/wavlm-large')
|
| 57 |
+
ssl_model: WavLMModel = WavLMModel.from_pretrained('microsoft/wavlm-large')
|
| 58 |
+
|
| 59 |
+
layer = 2
|
| 60 |
+
ssl_vocoder, cfg = load_pretrained_vocoder(f'wavlm{layer}_wavetrainerfit5')
|
| 61 |
+
waveform, sr = torchaudio.load('./assets/ljspeech-samples/LJ037-0171.wav')
|
| 62 |
+
if sr != 16000:
|
| 63 |
+
waveform = torchaudio.transforms.Resample(
|
| 64 |
+
orig_freq=sr,
|
| 65 |
+
new_freq=16000
|
| 66 |
+
)(waveform)
|
| 67 |
+
inputs = ssl_preprocessor(
|
| 68 |
+
waveform[0].numpy(),
|
| 69 |
+
sampling_rate=16000,
|
| 70 |
+
return_tensors="pt"
|
| 71 |
+
)
|
| 72 |
+
|
| 73 |
+
with torch.no_grad():
|
| 74 |
+
inputs = ssl_model(**inputs, output_hidden_states=True)
|
| 75 |
+
inputs = inputs.hidden_states[layer] # (Batch, Timeframe, Featuredim)
|
| 76 |
+
generated_waveform = ssl_vocoder.pred(
|
| 77 |
+
conditional_feature=inputs, # (Batch, Timeframe, Featuredim)
|
| 78 |
+
T_=5 # num of iteration
|
| 79 |
+
)
|
| 80 |
+
|
| 81 |
+
torchaudio.save(
|
| 82 |
+
'./assets/ljspeech-samples/LJ037-0171-reconstructed.wav',
|
| 83 |
+
generated_waveform[-1][:, 0].cpu(), 24000
|
| 84 |
+
)
|
| 85 |
+
```
|
| 86 |
+
|
| 87 |
+
## License
|
| 88 |
+
|
| 89 |
+
### Model Weights
|
| 90 |
+
|
| 91 |
+
**`xlsr*` and `whisper*` models**: Licensed under [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/)
|
| 92 |
+
- Based on XLS-R by facebook (Apache 2.0) - https://huggingface.co/facebook/wav2vec2-xls-r-300m
|
| 93 |
+
- Based on Whisper by OpenAI (Apache 2.0) - https://huggingface.co/openai/whisper-medium
|
| 94 |
+
|
| 95 |
+
**`wavlm*` models**: Licensed under [CC BY-SA 3.0](https://creativecommons.org/licenses/by-sa/3.0/)
|
| 96 |
+
- Based on WavLM by Microsoft Corporation ([CC BY-SA 3.0](https://github.com/microsoft/UniSpeech/blob/main/LICENSE)) - https://huggingface.co/microsoft/wavlm-large
|
| 97 |
+
- ⚠️ Derivative works must also use CC BY-SA 3.0
|
| 98 |
+
|
| 99 |
+
**Training data**: LibriTTS-R (CC BY 4.0) - https://www.openslr.org/141/
|
| 100 |
+
|
| 101 |
+
When using these models, you must comply with both our license and the original upstream model licenses.
|
| 102 |
+
|
assets/concept.png
ADDED
|
Git LFS Details
|
wavlm24_wavefit5/config.yaml
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5d586588d05a9b642ac80abbb32f01c3191425dc448bcda40bbea6f517e0a0ff
|
| 3 |
+
size 3677
|
wavlm24_wavefit5/discriminator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6aee01df939ed25c322d28762afc88674afe2d33b4a48f541446c54de758542b
|
| 3 |
+
size 67725739
|
wavlm24_wavefit5/generator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0e1b80a25af03d99141090b152db8810239d07573d86c1ef1633655a9e985b47
|
| 3 |
+
size 70381754
|
wavlm24_wavefit5/opt_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7a5c03b48398fc3105ccd5bb3ef77f963c37244caaf9912ab31d5523b39cbc9a
|
| 3 |
+
size 135436790
|
wavlm24_wavefit5/opt_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d09a1159985eda4d9c2d1823970b3ee4d98c5981ca27c51809bcab5960e4268e
|
| 3 |
+
size 140717450
|
wavlm24_wavefit5/sche_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a5ccff4d21f36dc59881448522898302255373b0fd88643993f30c892d74c3c3
|
| 3 |
+
size 1052
|
wavlm24_wavefit5/sche_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ac5e39de608aa344d36de8c5b797e21a0469120a8430cd24e6dc3c0b9bd28c37
|
| 3 |
+
size 1052
|
wavlm24_wavetrainerfit5/config.yaml
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a701b56b2f165cac78fbea42ba31c232c770fffafa80b306ac665a2d17df54e7
|
| 3 |
+
size 3912
|
wavlm24_wavetrainerfit5/discriminator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d347f5a97a9fd57dcafdcef74ff39b26f49128c0718ecf4c7c75543a48f161d1
|
| 3 |
+
size 67725739
|
wavlm24_wavetrainerfit5/generator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:468c8db191e4a20882255daf2d2fa2b1362f62d823968d546f9a2d5e9d962a84
|
| 3 |
+
size 91213626
|
wavlm24_wavetrainerfit5/opt_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:707cb0713154b1476a0accccbd8252912ba8ee46d10c133382aa7439f5cad606
|
| 3 |
+
size 135436790
|
wavlm24_wavetrainerfit5/opt_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7fc004ee2bcc1a461804565790849da25f044df55bd7da857e4dfbc963446b57
|
| 3 |
+
size 182384602
|
wavlm24_wavetrainerfit5/sche_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a5ccff4d21f36dc59881448522898302255373b0fd88643993f30c892d74c3c3
|
| 3 |
+
size 1052
|
wavlm24_wavetrainerfit5/sche_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ac5e39de608aa344d36de8c5b797e21a0469120a8430cd24e6dc3c0b9bd28c37
|
| 3 |
+
size 1052
|
wavlm2_wavefit5/config.yaml
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6e85396f18f084867faa6fdb1354e4e3bc35a7c48add5e95025bb6d411c38dcd
|
| 3 |
+
size 3673
|
wavlm2_wavefit5/discriminator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6e6aa5baf6c74fc46750287a29a23fa4546ff2730e7864ab2055d79abe370818
|
| 3 |
+
size 67725739
|
wavlm2_wavefit5/generator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:52de0f0156bd59aeac61ae07f5ddb016f82064b22fe047c607ebbd6426113e3d
|
| 3 |
+
size 70381754
|
wavlm2_wavefit5/opt_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:37abef47b4ed918754329e0e846712de923f322ae955a972b4d5563fce612428
|
| 3 |
+
size 135436790
|
wavlm2_wavefit5/opt_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:05c3f532b8bf92fd7c6fadb246ac59bd5d8fc20d353b4abcd4ea9824fe1946d1
|
| 3 |
+
size 140717450
|
wavlm2_wavefit5/sche_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a5ccff4d21f36dc59881448522898302255373b0fd88643993f30c892d74c3c3
|
| 3 |
+
size 1052
|
wavlm2_wavefit5/sche_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ac5e39de608aa344d36de8c5b797e21a0469120a8430cd24e6dc3c0b9bd28c37
|
| 3 |
+
size 1052
|
wavlm2_wavetrainerfit5/config.yaml
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:acf522aa9c5767f128ff303572d342185a4202068ef7776b3a32e0c5b40a7181
|
| 3 |
+
size 3908
|
wavlm2_wavetrainerfit5/discriminator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3a36ffaf39439d1658a480c832992679fd6d4689089bf2a3b220919b179d4c8d
|
| 3 |
+
size 67725739
|
wavlm2_wavetrainerfit5/generator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:877098b6e68cbd5f3aebab53eace69ac9da2e19c3427715e59c41db9307fac5e
|
| 3 |
+
size 91213626
|
wavlm2_wavetrainerfit5/opt_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bc2b82acce5d21d8cc2080f312e63aa4892db1cd5d3f82ea5c49f9d57ac6d99f
|
| 3 |
+
size 135436790
|
wavlm2_wavetrainerfit5/opt_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:abdd25c65f8e667081218e455ed112508afd71fae9ce43111104fab1f1fc7593
|
| 3 |
+
size 182384602
|
wavlm2_wavetrainerfit5/sche_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2fdb702a665e174fdb6a3143aa3339da61f0029022afa7569c641098aa643475
|
| 3 |
+
size 1052
|
wavlm2_wavetrainerfit5/sche_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fac4aba57e808c8fcf7e561404c578a6c2a5134d9ce4b6ec40e0ceb5e3436e73
|
| 3 |
+
size 1052
|
wavlm8_wavefit5/config.yaml
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:82e4e1d6a28bc0de9adcaa0e8d77e2732482b78dd4c1783ddba5ea7f4c990fe1
|
| 3 |
+
size 3673
|
wavlm8_wavefit5/discriminator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:83eb614cd4c210680945a5770159f0993d19baec822214366f4ab7174e9384f4
|
| 3 |
+
size 67725739
|
wavlm8_wavefit5/generator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2c9d1c589c7d18956f81a66dbf00836e417960a6e542f424381d305ebbe1811f
|
| 3 |
+
size 70381754
|
wavlm8_wavefit5/opt_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:65b8c76e303322ab79e1f63ff225b5d6e24eeecc0f5bbb5ec3fa785a42b3096a
|
| 3 |
+
size 135436790
|
wavlm8_wavefit5/opt_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b02919a98e60f811e09df28b96555443e79f502f9aa60f07761233b9e0ea9b59
|
| 3 |
+
size 140717450
|
wavlm8_wavefit5/sche_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a5ccff4d21f36dc59881448522898302255373b0fd88643993f30c892d74c3c3
|
| 3 |
+
size 1052
|
wavlm8_wavefit5/sche_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ac5e39de608aa344d36de8c5b797e21a0469120a8430cd24e6dc3c0b9bd28c37
|
| 3 |
+
size 1052
|
wavlm8_wavetrainerfit5/config.yaml
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5eeca4b9b0ca8714fda83c131b9afa4c23645fedda1d2f1c39195a1b8aa7b8f0
|
| 3 |
+
size 3908
|
wavlm8_wavetrainerfit5/discriminator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f7246427cdaa1eb4195f1e6654dcb429c39ac18be704170efab70c04e443527a
|
| 3 |
+
size 67725739
|
wavlm8_wavetrainerfit5/generator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:19ab744075817741909515bb7158baf61aa2086ff3c031b74d5c888109cb8a04
|
| 3 |
+
size 91213626
|
wavlm8_wavetrainerfit5/opt_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2fee63b67aac8f4def14d7fd4c2ebe151eafb49093020d1f70bb245fc8b5f8ba
|
| 3 |
+
size 135436790
|
wavlm8_wavetrainerfit5/opt_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5bee39ba66414fb4566eb0cc9ce84d1fd71af048d2185e04b15a81a58088d3db
|
| 3 |
+
size 182384602
|
wavlm8_wavetrainerfit5/sche_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a5ccff4d21f36dc59881448522898302255373b0fd88643993f30c892d74c3c3
|
| 3 |
+
size 1052
|
wavlm8_wavetrainerfit5/sche_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ac5e39de608aa344d36de8c5b797e21a0469120a8430cd24e6dc3c0b9bd28c37
|
| 3 |
+
size 1052
|
whisper8_wavefit5/config.yaml
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dd1c5e6bf31cb139aca11dc672dd0f1dd24e84870059308b7a551f2639c0fd26
|
| 3 |
+
size 3681
|
whisper8_wavefit5/discriminator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d6e8fafbf5a9d606065279ba4c919018c652f4d4ace7564e793b1ab24a0c847a
|
| 3 |
+
size 67725739
|
whisper8_wavefit5/generator.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:862254c7c6fee39023ee20950d7e843d154eadb258893680b3824d48a9262ff0
|
| 3 |
+
size 70381754
|
whisper8_wavefit5/opt_d.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9734c216ba0bc72314d3910b764b4284dd9e51fcabe3d81885cc8b8ac35b6d90
|
| 3 |
+
size 135436790
|
whisper8_wavefit5/opt_g.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5f18eb7c1927042d49bfa2c2afb46784bb92d5bb21a0163ad5821ca02d26ab8c
|
| 3 |
+
size 140717130
|