HebTTS
Browse files- .gitattributes +2 -0
- README.md +18 -0
- additional_info.pdf +3 -0
- checkpoint-150000.pt +3 -0
- dataset/2407.07566v1 HEBDB. A Weakly Supervised Dataset for Hebrew Speech Processing.pdf +3 -0
- source.txt +2 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
additional_info.pdf filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
dataset/2407.07566v1[[:space:]]HEBDB.[[:space:]]A[[:space:]]Weakly[[:space:]]Supervised[[:space:]]Dataset[[:space:]]for[[:space:]]Hebrew[[:space:]]Speech[[:space:]]Processing.pdf filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,18 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
datasets:
|
| 3 |
+
- SLPRL-HUJI/HebDB
|
| 4 |
+
language:
|
| 5 |
+
- he
|
| 6 |
+
metrics:
|
| 7 |
+
- wer
|
| 8 |
+
- cer
|
| 9 |
+
pipeline_tag: text-to-speech
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
|
| 13 |
+
# Details
|
| 14 |
+
|
| 15 |
+
This model is an implementation of the vall-e architecture, with the AlephBert text tokenizer.
|
| 16 |
+
This model was trained as a final project in the "DSP & audio processing using Deep Learning" class at Tel-Aviv University, Israel.
|
| 17 |
+
|
| 18 |
+
Implementation details and references can be found in the included 'paper' PDF.
|
additional_info.pdf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:89b412730083c73688f540f0358bc1808ef038baf33f27df7e729e0ec7e8f9ec
|
| 3 |
+
size 509426
|
checkpoint-150000.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1fecc21e683dd103a7bfed57f986c14c84213f93aec815af73947e3dfec63e81
|
| 3 |
+
size 2551217450
|
dataset/2407.07566v1 HEBDB. A Weakly Supervised Dataset for Hebrew Speech Processing.pdf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:97febbefdb88fd63147d9eec9bc60138480a78a44eb0d40061bdce2e26a6cfa8
|
| 3 |
+
size 278562
|
source.txt
ADDED
|
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
|
|
|
| 1 |
+
https://huggingface.co/D4niel0s/HebTTS_implementation
|
| 2 |
+
https://huggingface.co/datasets/SLPRL-HUJI/HebDB
|