File size: 422 Bytes
c645912
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
---
datasets:
- SLPRL-HUJI/HebDB
language:
- he
metrics:
- wer
- cer
pipeline_tag: text-to-speech
---


# Details

This model is an implementation of the vall-e architecture, with the AlephBert text tokenizer.
This model was trained as a final project in the "DSP & audio processing using Deep Learning" class at Tel-Aviv University, Israel.

Implementation details and references can be found in the included 'paper' PDF.