| # voicedesign_t0 |
| |
| PromptTTS model based on **Qwen3-TTS-12Hz-1.7B-Base**, with small updates from fine-tuning on additional training data. |
| |
| ## Contents |
| - `miner.py` β Vocence PromptTTS engine (`class Miner`: `__init__`, `warmup`, `generate_wav`) |
| - `chute_config.yml` β Chutes build config (image, node selector, chute settings) |
| - `vocence_config.yaml` β runtime options (sample_rate, adapter, limits) |
| - `model.safetensors` β fine-tuned model weights |
| - `speech_tokenizer/` β RVQ speech tokenizer |
| - `tokenizer_config.json`, `vocab.json`, `merges.txt` β text tokenizer assets |
|
|
| ## Training notes |
| Small incremental update over the base model, fine-tuned on a modest instruction-annotated dataset. Intended for Vocence subnet (Bittensor SN78) PromptTTS evaluation. |
|
|