File size: 1,721 Bytes
40ade29
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
---
license: agpl-3.0
tags:
- tts
- text-to-speech
- gpt-sovits
- voice-clone
- japanese
language:
- ja
---

# ATRI Voice Model — GPT-SoVITS v2Pro

> **WARNING: This model is for personal and research use only. Do not use it for commercial purposes or to impersonate real individuals.**

---

### Overview

A fine-tuned [GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS) v2Pro voice model for ATRI (from *ATRI -My Dear Moments-*), capable of synthesizing speech in Japanese, Chinese, and English.

### Files

- `ATR_e8_s3952.pth` — Fine-tuned SoVITS model weights (8 epochs, 3952 steps)
- `ref_audio.wav` — Reference audio for inference
- `api_atri.py` — FastAPI-based TTS inference server

### Usage

1. Clone and set up [GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS) following its instructions.
2. Download the GPT pretrained model `s1v3.ckpt` from GPT-SoVITS (included in its pretrained models).
3. Place `ATR_e8_s3952.pth` and `ref_audio.wav` in your preferred location.
4. Update the paths in `api_atri.py` (replace `/path/to/` placeholders with actual paths).
5. Run the API server:

```bash
cd /path/to/GPT-SoVITS
python api_atri.py
```

API docs will be available at `http://127.0.0.1:9880/docs`.

### API Endpoints

| Endpoint | Method | Description |
|---|---|---|
| `/health` | GET | Health check |
| `/tts` | POST | Text-to-speech (returns full audio) |
| `/tts/stream` | POST | Streaming text-to-speech |

### Reference Audio

- **Text**: わたしはマスターの所有物ですので。 勝手に売買するのは違法です
- **Language**: Japanese

### License

This project is licensed under [AGPL-3.0](LICENSE), consistent with [GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS).