voicedesign_t0 / README.md
macminix's picture
Initial commit
2094eb1
# voicedesign_t0
PromptTTS model based on **Qwen3-TTS-12Hz-1.7B-Base**, with small updates from fine-tuning on additional training data.
## Contents
- `miner.py` β€” Vocence PromptTTS engine (`class Miner`: `__init__`, `warmup`, `generate_wav`)
- `chute_config.yml` β€” Chutes build config (image, node selector, chute settings)
- `vocence_config.yaml` β€” runtime options (sample_rate, adapter, limits)
- `model.safetensors` β€” fine-tuned model weights
- `speech_tokenizer/` β€” RVQ speech tokenizer
- `tokenizer_config.json`, `vocab.json`, `merges.txt` β€” text tokenizer assets
## Training notes
Small incremental update over the base model, fine-tuned on a modest instruction-annotated dataset. Intended for Vocence subnet (Bittensor SN78) PromptTTS evaluation.