Commit
·
8471a16
1
Parent(s):
38f0a65
add models
Browse files
README.md
CHANGED
|
@@ -2,3 +2,47 @@
|
|
| 2 |
license: cc-by-nc-4.0
|
| 3 |
---
|
| 4 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
license: cc-by-nc-4.0
|
| 3 |
---
|
| 4 |
|
| 5 |
+
### guitar_iil_b2048_r48000_z16.ts
|
| 6 |
+
|
| 7 |
+
Dataset: [IILGuitarTimbre](https://github.com/Intelligent-Instruments-Lab/IILGuitarTimbre).
|
| 8 |
+
|
| 9 |
+
Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
|
| 10 |
+
|
| 11 |
+
### organ_archive_b2048_r48000_z16.ts
|
| 12 |
+
|
| 13 |
+
Dataset: public domain organ music from archive.org. Small amounts of voice and other instruments were included, and vinyl record noises are prominent.
|
| 14 |
+
|
| 15 |
+
Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
|
| 16 |
+
|
| 17 |
+
### organ_bach_b2048_sr48000_z16.ts
|
| 18 |
+
|
| 19 |
+
Dataset: various recordings of J. S. Bach music for church organ.
|
| 20 |
+
|
| 21 |
+
Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
|
| 22 |
+
|
| 23 |
+
### voice_vocalset_b2048_r48000_z16.ts
|
| 24 |
+
|
| 25 |
+
Dataset: [VocalSet](https://zenodo.org/record/1193957) singing voice dataset.
|
| 26 |
+
|
| 27 |
+
Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
|
| 28 |
+
|
| 29 |
+
### voice_hifitts_b2048_r48000_z16.ts
|
| 30 |
+
|
| 31 |
+
Dataset: [Hi-Fi TTS](http://arxiv.org/abs/2104.01497) audiobooks dataset.
|
| 32 |
+
|
| 33 |
+
Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
|
| 34 |
+
|
| 35 |
+
### voice_jvs_b2048_r44100_z16.ts
|
| 36 |
+
|
| 37 |
+
Dataset: [Hi-Fi TTS](http://arxiv.org/abs/2104.01497) speaker 9017 (John Van Stan).
|
| 38 |
+
|
| 39 |
+
Model: RAVE v3, 44.1kHz, block size 2048, 16 latent dimensions.
|
| 40 |
+
|
| 41 |
+
### voice_vctk_b2048_r44100_z16.ts
|
| 42 |
+
|
| 43 |
+
Dataset: [CSTR VCTK Corpus](https://datashare.ed.ac.uk/handle/10283/3443) multispeaker read speech dataset.
|
| 44 |
+
|
| 45 |
+
Model: RAVE v3, 44.1kHz, block size 2048, 22 latent dimensions.
|
| 46 |
+
|
| 47 |
+
|
| 48 |
+
|
guitar_iil_b2048_r48000_z16.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:02458214e23890d6818504319a5b9903eabfe87a524491f6524f453e7f3dbcf0
|
| 3 |
+
size 163881670
|
organ_archive_b2048_r48000_z16.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7fb80ff896c114e1ed436dfa4059e23694c8b0e36f2b16532b637f9b8854f96d
|
| 3 |
+
size 163885039
|
organ_bach_b2048_sr48000_z16.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f7c06309e0388e666993226c06ed1438b56adc23b2a5a3b8f9155ed26990423c
|
| 3 |
+
size 163879431
|
voice_hifitts_b2048_r48000_z16.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:67e888716655c5670d5d9e15d0bc43b5851ddd7a3004512a0c400a2eeb62522a
|
| 3 |
+
size 163881009
|
voice_jvs_b2048_r44100_z16.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5d41684d151c0a98a51815479d866c1b4f8d8cbe2cdb62652d27f6ff2286ed77
|
| 3 |
+
size 150059552
|
voice_vctk_b2048_r44100_z22.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9e5578ea2c98856eff6b511089cc1eaba69eaf85527ad343604a6420fe3a751f
|
| 3 |
+
size 150058264
|
voice_vocalset_b2048_r48000_z16.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ba1b5392c4645c8040aa618e43b8269d840b9752536caac37c91c698334fa9a6
|
| 3 |
+
size 163882118
|