macminix
/

voicedesign_t0

Model card Files Files and versions

voicedesign_t0 / README.md

macminix's picture

Initial commit

2094eb1 about 1 month ago

|

history blame contribute delete

770 Bytes

	# voicedesign_t0

	PromptTTS model based on Qwen3-TTS-12Hz-1.7B-Base, with small updates from fine-tuning on additional training data.

	## Contents
	- `miner.py` — Vocence PromptTTS engine (`class Miner`: `__init__`, `warmup`, `generate_wav`)
	- `chute_config.yml` — Chutes build config (image, node selector, chute settings)
	- `vocence_config.yaml` — runtime options (sample_rate, adapter, limits)
	- `model.safetensors` — fine-tuned model weights
	- `speech_tokenizer/` — RVQ speech tokenizer
	- `tokenizer_config.json`, `vocab.json`, `merges.txt` — text tokenizer assets

	## Training notes
	Small incremental update over the base model, fine-tuned on a modest instruction-annotated dataset. Intended for Vocence subnet (Bittensor SN78) PromptTTS evaluation.