ATRI Voice Model โ GPT-SoVITS v2Pro
WARNING: This model is for personal and research use only. Do not use it for commercial purposes or to impersonate real individuals.
Overview
A fine-tuned GPT-SoVITS v2Pro voice model for ATRI (from ATRI -My Dear Moments-), capable of synthesizing speech in Japanese, Chinese, and English.
Files
ATR_e8_s3952.pthโ Fine-tuned SoVITS model weights (8 epochs, 3952 steps)ref_audio.wavโ Reference audio for inferenceapi_atri.pyโ FastAPI-based TTS inference server
Usage
- Clone and set up GPT-SoVITS following its instructions.
- Download the GPT pretrained model
s1v3.ckptfrom GPT-SoVITS (included in its pretrained models). - Place
ATR_e8_s3952.pthandref_audio.wavin your preferred location. - Update the paths in
api_atri.py(replace/path/to/placeholders with actual paths). - Run the API server:
cd /path/to/GPT-SoVITS
python api_atri.py
API docs will be available at http://127.0.0.1:9880/docs.
API Endpoints
| Endpoint | Method | Description |
|---|---|---|
/health |
GET | Health check |
/tts |
POST | Text-to-speech (returns full audio) |
/tts/stream |
POST | Streaming text-to-speech |
Reference Audio
- Text: ใใใใฏใในใฟใผใฎๆๆ็ฉใงใใฎใงใ ๅๆใซๅฃฒ่ฒทใใใฎใฏ้ๆณใงใ
- Language: Japanese
License
This project is licensed under AGPL-3.0, consistent with GPT-SoVITS.