atri-sovits / README.md

Upload folder using huggingface_hub

40ade29 verified 14 days ago

1.72 kB

license: agpl-3.0
tags:
  - tts
  - text-to-speech
  - gpt-sovits
  - voice-clone
  - japanese
language:
  - ja

ATRI Voice Model — GPT-SoVITS v2Pro

WARNING: This model is for personal and research use only. Do not use it for commercial purposes or to impersonate real individuals.

A fine-tuned GPT-SoVITS v2Pro voice model for ATRI (from ATRI -My Dear Moments-), capable of synthesizing speech in Japanese, Chinese, and English.

Clone and set up GPT-SoVITS following its instructions.
Download the GPT pretrained model s1v3.ckpt from GPT-SoVITS (included in its pretrained models).
Place ATR_e8_s3952.pth and ref_audio.wav in your preferred location.
Update the paths in api_atri.py (replace /path/to/ placeholders with actual paths).
Run the API server:

cd /path/to/GPT-SoVITS
python api_atri.py

API docs will be available at http://127.0.0.1:9880/docs.

Endpoint	Method	Description
`/health`	GET	Health check
`/tts`	POST	Text-to-speech (returns full audio)
`/tts/stream`	POST	Streaming text-to-speech

This project is licensed under AGPL-3.0, consistent with GPT-SoVITS.