urtzai commited on
Commit
d7d0b80
·
verified ·
1 Parent(s): eb01125

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +60 -1
README.md CHANGED
@@ -10,4 +10,63 @@ tags:
10
  - speech-synthesis
11
  - Basque
12
  - piper
13
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  - speech-synthesis
11
  - Basque
12
  - piper
13
+ ---
14
+
15
+ # Basque TTS: Maider (Piper Version)
16
+
17
+ This repository contains a [Piper](https://github.com/OHF-Voice/piper1-gpl) compatible version of the **Maider** Basque text-to-speech model. The original model was developed by **HiTZ Basque Center for Language Technology - Aholab Signal Processing Laboratory** (University of the Basque Country UPV/EHU).
18
+
19
+ This version has been exported/trained specifically for use with the Piper TTS engine, a fast, local neural text-to-speech engine.
20
+
21
+ ## Model Details
22
+
23
+ - **Language:** Basque (eu)
24
+ - **Speaker:** Maider (Female)
25
+ - **Architecture:** VITS (Optimized for Piper)
26
+ - **Original Credits:** HiTZ Center / Aholab (Project ILENIA)
27
+ - **Format:** Piper (`.onnx` and `.onnx.json` config)
28
+
29
+ ## Training Details
30
+
31
+ - **Architecture:** VITS
32
+ - **Training Engine:** Piper (PyTorch Lightning)
33
+ - **Iterations:** 22 epochs
34
+ - **Sample Rate:** 22050 Hz
35
+ - **Phonemization:** espeak-ng (Basque)
36
+
37
+ ## Files Included
38
+ * `eu-maider-medium.onnx`: The exported model for fast inference.
39
+ * `eu-maider-medium.onnx.json`: The configuration file (includes phoneme map and synthesis settings).
40
+ * `epoch=22-step=258750.ckpt`: The PyTorch Lightning checkpoint from the 22nd iteration (useful for further training/fine-tuning).
41
+
42
+ ## Usage
43
+
44
+ ### Using Piper CLI
45
+ You can run the model locally using the Piper binary:
46
+
47
+ ```bash
48
+ echo "Kaixo, hau Maider da, Piper motorra erabiliz euskaraz hitz egiten." | \
49
+ ./piper --model eu-maider-medium.onnx --output_file output.wav
50
+ ```
51
+ ### Python API
52
+
53
+ ```Python
54
+ from piper.voice import PiperVoice
55
+
56
+ voice = PiperVoice.load("eu-maider-medium.onnx", "eu-maider-medium.onnx.json")
57
+ with open("output.wav", "wb") as f:
58
+ voice.synthesize_wav("Gaur egun eguzkitsua dugu.", f)
59
+ ```
60
+
61
+ ## Original Model & Data Source
62
+
63
+ The base model belongs to the Aholab TTS collection. All voices in this collection are based on the VITS architecture proposed by Kim et al. (2021).
64
+
65
+ Maider & Antton: Developed by HiTZ with funding from Project ILENIA.
66
+
67
+ License: Public Creative Commons Attribution 4.0 (for the voice resource) and Apache License 2.0 (for the code/model).
68
+
69
+ ## Authors & Credits
70
+
71
+ The original Maider model was created by:
72
+ HiTZ Basque Center for Language Technology - Aholab Signal Processing Laboratory, University of the Basque Country EHU.