File size: 1,554 Bytes
f9c2c48
 
 
 
 
 
0247b6d
f9c2c48
6dd5569
f9c2c48
 
 
6dd5569
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f9c2c48
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
---
title: Kokoro TTS
emoji: 
colorFrom: purple
colorTo: purple
sdk: gradio
sdk_version: 6.5.1
app_file: app.py
python_version: 3.11
pinned: false
---

# Kokoro TTS & Voice Cloning System

A complete solution for high-quality Text-to-Speech using the Kokoro-82M model.

## Features
- **All Pre-built Voices**: Includes American, British, and International voices (Spanish, French, Italian, Japanese, Portuguese).
- **Long Script Processing**: Automatic chunking and merging for processing long texts without cutting off.
- **Voice Cloning Interface**: Upload a reference to "clone" voices (supports loading custom `.pt` voice vectors).
- **Gradio Web UI**: Simple and intuitive interface for all functions.

## Installation
1. Run `setup.bat` to install all dependencies.
2. Make sure you have Python installed and it's in your PATH.

## How to Run
1. Run `run.bat`.
2. Open the URL provided in the terminal (e.g., `http://127.0.0.1:7860`).

## Voices Included
- **American**: `af_heart`, `af_bella`, `af_nicole`, `af_sarah`, `af_sky`, `am_adam`, `am_michael`, `am_fenrir`, `am_puck`.
- **British**: `bf_emma`, `bf_isabella`, `bm_george`, `bm_lewis`.
- **International**: `jo_zeus` (JA), `es_barcelona` (ES), `fr_paris` (FR), `it_florence` (IT), and more.

## Long Scripts
Toggle "Enable Long Script Processing" for large texts. This will split the text into chunks, process them individually, and merge the result into a single continuous audio file.


Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference