Emmylahot12 commited on
Commit
245bfbe
·
verified ·
1 Parent(s): ccc2c0e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +44 -34
README.md CHANGED
@@ -1,34 +1,44 @@
1
- ---
2
- license: apache-2.0
3
- tags:
4
- - text-to-speech
5
- - TTS
6
- - audio
7
- - speech
8
- - xtts
9
- language:
10
- - en
11
- datasets:
12
- - Emmylahot12/nnamdi
13
- model-index:
14
- - name: nnamdi-tts
15
- results: []
16
- ---
17
-
18
- # Nnamdi TTS Model
19
-
20
- This model is a text-to-speech (TTS) voice synthesis model using Coqui XTTS v2, built using the voice of the user from the `nnamdi` dataset.
21
-
22
- ## How to Use
23
-
24
- ```python
25
- from TTS.api import TTS
26
-
27
- tts = TTS(model_name="tts_models/multilingual/multi-dataset/xtts_v2", gpu=False)
28
-
29
- tts.tts_to_file(
30
- text="Hello, this is a custom voice!",
31
- speaker_wav="yourvoice.wav",
32
- language="en",
33
- file_path="output.wav"
34
- )
 
 
 
 
 
 
 
 
 
 
 
1
+ # CloneTTS - Text-to-Speech Model
2
+
3
+ CloneTTS is a Text-to-Speech (TTS) model trained on the **Clone** dataset. The model converts text input into natural-sounding speech and is built to facilitate speech synthesis tasks. It uses the Clone dataset for training, which includes transcriptions and corresponding audio files.
4
+
5
+ ## License
6
+
7
+ This model is licensed under the [Creative Commons Attribution 4.0 International License (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/). You are free to share, adapt, and use the model for any purpose, including commercial uses, as long as appropriate credit is given.
8
+
9
+ ## Model Overview
10
+
11
+ - **Input**: Text data.
12
+ - **Output**: `.wav` audio files (speech).
13
+ - **Task**: Text-to-speech (TTS) conversion.
14
+
15
+ ### Features
16
+
17
+ - Convert text to high-quality, natural-sounding speech.
18
+ - Trained using the Clone dataset, designed to improve the quality of generated speech.
19
+
20
+ ## Dataset Overview
21
+
22
+ This model is trained on the **Clone** dataset, which consists of:
23
+
24
+ - **Audio files**: `.wav` format.
25
+ - **Transcriptions**: Corresponding text transcriptions for each audio file.
26
+ - **Format**: A CSV file that pairs audio file paths with their corresponding text.
27
+
28
+ ### File Structure
29
+
30
+ - `data/`: Contains the audio files and the `transcriptions.csv` file used to train the model.
31
+ - `model/`: Contains the trained model files, including `model_weights.h5` and `model_config.json`.
32
+ - `notebooks/`: Contains Jupyter notebooks for experimenting with the model and performing inference.
33
+ - `requirements.txt`: A list of required libraries and dependencies for running the model.
34
+ - `train.py`: Script to train the model on your dataset.
35
+
36
+ ## Installation
37
+
38
+ To use this model, follow the instructions below to clone the repository and install dependencies.
39
+
40
+ 1. Clone the repository:
41
+
42
+ ```bash
43
+ git clone https://github.com/your_username/CloneTTS.git
44
+ cd CloneTTS