| --- |
| license: agpl-3.0 |
| tags: |
| - tts |
| - text-to-speech |
| - gpt-sovits |
| - voice-clone |
| - japanese |
| language: |
| - ja |
| --- |
| |
| # ATRI Voice Model — GPT-SoVITS v2Pro |
|
|
| > **WARNING: This model is for personal and research use only. Do not use it for commercial purposes or to impersonate real individuals.** |
|
|
| --- |
|
|
| ### Overview |
|
|
| A fine-tuned [GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS) v2Pro voice model for ATRI (from *ATRI -My Dear Moments-*), capable of synthesizing speech in Japanese, Chinese, and English. |
|
|
| ### Files |
|
|
| - `ATR_e8_s3952.pth` — Fine-tuned SoVITS model weights (8 epochs, 3952 steps) |
| - `ref_audio.wav` — Reference audio for inference |
| - `api_atri.py` — FastAPI-based TTS inference server |
|
|
| ### Usage |
|
|
| 1. Clone and set up [GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS) following its instructions. |
| 2. Download the GPT pretrained model `s1v3.ckpt` from GPT-SoVITS (included in its pretrained models). |
| 3. Place `ATR_e8_s3952.pth` and `ref_audio.wav` in your preferred location. |
| 4. Update the paths in `api_atri.py` (replace `/path/to/` placeholders with actual paths). |
| 5. Run the API server: |
|
|
| ```bash |
| cd /path/to/GPT-SoVITS |
| python api_atri.py |
| ``` |
|
|
| API docs will be available at `http://127.0.0.1:9880/docs`. |
|
|
| ### API Endpoints |
|
|
| | Endpoint | Method | Description | |
| |---|---|---| |
| | `/health` | GET | Health check | |
| | `/tts` | POST | Text-to-speech (returns full audio) | |
| | `/tts/stream` | POST | Streaming text-to-speech | |
|
|
| ### Reference Audio |
|
|
| - **Text**: わたしはマスターの所有物ですので。 勝手に売買するのは違法です |
| - **Language**: Japanese |
|
|
| ### License |
|
|
| This project is licensed under [AGPL-3.0](LICENSE), consistent with [GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS). |
|
|