ZLSCompLing commited on
Commit
596eab1
·
verified ·
1 Parent(s): c327128

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +102 -3
README.md CHANGED
@@ -1,3 +1,102 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ language:
4
+ - lb
5
+ tags:
6
+ - text-to-speech
7
+ - tts
8
+ - vits
9
+ - coqui
10
+ - luxembourgish
11
+ library_name: coqui
12
+ pipeline_tag: text-to-speech
13
+ ---
14
+
15
+ # Coqui TTS - Max (Luxembourgish Male Voice)
16
+
17
+ A VITS-based text-to-speech model for Luxembourgish, featuring a natural male voice.
18
+
19
+ ## Model Description
20
+
21
+ This model was trained using the [Coqui TTS](https://github.com/coqui-ai/TTS) framework on Luxembourgish speech data from the [Lëtzebuerger Online Dictionnaire (LOD)](https://lod.lu) example sentences.
22
+
23
+ "Max" is a male Luxembourgish voice based on recordings from a real speaker.
24
+
25
+ ### Model Details
26
+
27
+ - **Architecture:** VITS
28
+ - **Language:** Luxembourgish (lb)
29
+ - **Speaker:** Single speaker (male)
30
+ - **Sample Rate:** 22050 Hz
31
+ - **License:** CC BY-NC 4.0 (Non-commercial use only)
32
+
33
+ ## License Notice
34
+
35
+ **This model is for non-commercial use only.** All commercial uses are prohibited. The voice data is derived from recordings of a real speaker and may only be used freely for non-commercial purposes.
36
+
37
+ ## Usage
38
+
39
+ ```python
40
+ import torch
41
+ import scipy.io.wavfile as wavfile
42
+ from TTS.utils.synthesizer import Synthesizer
43
+
44
+ # Load the model
45
+ synthesizer = Synthesizer(
46
+ tts_checkpoint="path/to/coqui-tts-max.pth",
47
+ tts_config_path="path/to/config.json",
48
+ use_cuda=torch.cuda.is_available()
49
+ )
50
+
51
+ # Generate speech
52
+ wav = synthesizer.tts("Moien, wéi geet et dir?")
53
+
54
+ # Save to file
55
+ wavfile.write("output.wav", 22050, wav)
56
+ ```
57
+
58
+ ### Example Sentences
59
+
60
+ The model was validated on sentences including:
61
+
62
+ - "An der Zäit hunn sech den Nordwand an d'Sonn gestridden, wie vun hinnen zwee wuel méi staark wier."
63
+ - "Wéi e Wanderer, deen an ee waarme Mantel agepak war, iwwert de Wee koum."
64
+
65
+ ## Technical Specifications
66
+
67
+ | Parameter | Value |
68
+ |-----------|-------|
69
+ | Hidden Channels | 192 |
70
+ | Text Encoder Layers | 6 |
71
+ | Posterior Encoder Layers | 16 |
72
+ | Flow Layers | 4 |
73
+ | Mel Channels | 80 |
74
+ | FFT Size | 1024 |
75
+
76
+ ## Citation
77
+
78
+ If you use this model, please cite:
79
+
80
+ ```bibtex
81
+ @misc{zls2025coquimax,
82
+ title={Coqui TTS Max - Luxembourgish Male Voice},
83
+ author={Zenter fir d'Lëtzebuerger Sprooch},
84
+ year={2025},
85
+ publisher={Hugging Face},
86
+ url={https://huggingface.co/ZLSCompLing/CoquiTTS-Max}
87
+ }
88
+ ```
89
+
90
+ ## Acknowledgments
91
+
92
+ Originally trained by [Marco Barnig](https://huggingface.co/mbarnig). Now developed and maintained by [Zenter fir d'Lëtzebuerger Sprooch](https://zls.lu).
93
+
94
+ Voice data sourced from the [Lëtzebuerger Online Dictionnaire (LOD)](https://lod.lu). The original audio files are available via the [LOD linguistic data on data.public.lu](https://data.public.lu/en/datasets/letzebuerger-online-dictionnaire-lod-linguistesch-daten/), which provides an XML file containing example sentence IDs. Audio files can be accessed at:
95
+
96
+ ```
97
+ https://lod.lu/uploads/examples/AAC/{folder}/{id}.m4a
98
+ ```
99
+
100
+ where `{folder}` is the first 2 characters of `{id}`.
101
+
102
+ This model is used in [Sproochmaschinn](https://sproochmaschinn.lu), a Luxembourgish speech processing platform.