Spaces:

CherithCutestory
/

vlengine-xttsv2

Paused

App Files Files Community

vlengine-xttsv2 / README.md

CherithCutestory

Adding new engine files

8962735 3 months ago

preview code

raw

history blame contribute delete

1.41 kB

metadata

title: VoxLibris XTTSv2 Engine
emoji: 🔊
colorFrom: purple
colorTo: indigo
sdk: docker
app_port: 7860
pinned: false

VoxLibris XTTSv2 TTS Engine

A HuggingFace Space that serves the Coqui XTTSv2 model as a REST API, implementing the VoxLibris TTS Engine API Contract.

Endpoints

POST /GetEngineDetails

Returns engine capabilities, supported emotions, and available languages.

POST /ConvertTextToSpeech

Converts text to speech. Supports voice cloning via base64-encoded WAV samples.

GET /health

Returns model loading status.

Authentication

Set the API_KEY secret in your HuggingFace Space settings. Requests must include Authorization: Bearer <your-key> header. Leave API_KEY unset to disable authentication.

Voice Cloning

XTTSv2 supports voice cloning. Send a base64-encoded WAV file in the voice_to_clone_sample field of the ConvertTextToSpeech request. A 6-15 second clear speech sample works best.

Supported Languages

en, es, fr, de, it, pt, pl, tr, ru, nl, cs, ar, zh-cn, ja, hu, ko

Deployment

Create a new HuggingFace Space with Docker SDK
Upload the contents of this folder
Set the API_KEY secret in Space settings (optional)
The model downloads automatically on first startup (~1.8 GB)
Register the Space URL in VoxLibris Settings under TTS Engine Management