Emmylahot12
/

TextToCloneSpeech

speech synthesis

voice generation

Model card Files Files and versions

Emmylahot12 commited on May 8, 2025

Commit

245bfbe

·

verified ·

1 Parent(s): ccc2c0e

Update README.md

Files changed (1) hide show

README.md +44 -34

README.md CHANGED Viewed

@@ -1,34 +1,44 @@
----
-license: apache-2.0
-tags:
-  - text-to-speech
-  - TTS
-  - audio
-  - speech
-  - xtts
-language:
-  - en
-datasets:
-  - Emmylahot12/nnamdi
-model-index:
-  - name: nnamdi-tts
-    results: []
----
-# Nnamdi TTS Model
-This model is a text-to-speech (TTS) voice synthesis model using Coqui XTTS v2, built using the voice of the user from the `nnamdi` dataset.
-## How to Use
-```python
-from TTS.api import TTS
-tts = TTS(model_name="tts_models/multilingual/multi-dataset/xtts_v2", gpu=False)
-tts.tts_to_file(
-    text="Hello, this is a custom voice!",
-    speaker_wav="yourvoice.wav",
-    language="en",
-    file_path="output.wav"
-)

+# CloneTTS - Text-to-Speech Model
+CloneTTS is a Text-to-Speech (TTS) model trained on the **Clone** dataset. The model converts text input into natural-sounding speech and is built to facilitate speech synthesis tasks. It uses the Clone dataset for training, which includes transcriptions and corresponding audio files.
+## License
+This model is licensed under the [Creative Commons Attribution 4.0 International License (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/). You are free to share, adapt, and use the model for any purpose, including commercial uses, as long as appropriate credit is given.
+## Model Overview
+- **Input**: Text data.
+- **Output**: `.wav` audio files (speech).
+- **Task**: Text-to-speech (TTS) conversion.
+### Features
+- Convert text to high-quality, natural-sounding speech.
+- Trained using the Clone dataset, designed to improve the quality of generated speech.
+## Dataset Overview
+This model is trained on the **Clone** dataset, which consists of:
+- **Audio files**: `.wav` format.
+- **Transcriptions**: Corresponding text transcriptions for each audio file.
+- **Format**: A CSV file that pairs audio file paths with their corresponding text.
+### File Structure
+- `data/`: Contains the audio files and the `transcriptions.csv` file used to train the model.
+- `model/`: Contains the trained model files, including `model_weights.h5` and `model_config.json`.
+- `notebooks/`: Contains Jupyter notebooks for experimenting with the model and performing inference.
+- `requirements.txt`: A list of required libraries and dependencies for running the model.
+- `train.py`: Script to train the model on your dataset.
+## Installation
+To use this model, follow the instructions below to clone the repository and install dependencies.
+1. Clone the repository:
+```bash
+git clone https://github.com/your_username/CloneTTS.git
+cd CloneTTS