monkey_01 / README.md
Gem1832's picture
Upload folder using huggingface_hub
543e56d verified

shiningstar1128/monkey_01

QWEN3Vox voice TTS for the Vocence subnet.

Hugging Face repo id

model_name in vocence_config.yaml, on-chain commit, and chute wrapper VOCENCE_REPO must all be:

shiningstar1128/monkey_01

Validator /speak contract

Validators POST JSON (natural-language prompts after source extraction):

{
  "text": "<transcription to synthesize>",
  "instruction": "An adult female with an American accent, speaking at a normal pace in a mid-range pitch, sounding neutral and formal throughout"
}
  • text — passed verbatim as the synthesis script (length cap only).
  • instruction — passed verbatim to the processor system_prompt (length cap only).

How this engine uses them:

  • instruction → processor system_prompt (no trait parsing or paraphrasing).
  • text → script body (plain transcript or existing Speaker N: lines).
  • Default aux audio shard for acoustic prefill only.

Repo layout

File Purpose
miner.py QWEN3Vox engine (Miner class)
vocence_config.yaml model_name + runtime/generation limits
chute_config.yml Chutes image, GPU (64 GB VRAM), pip deps
model-0000*-of-00003.safetensors Sharded weights (≥ 50 MiB total)
aux_lm_residual_projection.safetensors Aux conditioning shard
tokenizer.json, tokenizer_config.json, vocab.json, merges.txt Tokenizer

Deploy

  1. Upload this folder to shiningstar1128/monkey_01 and pin a 40-char commit SHA.
  2. Render/deploy chute with VOCENCE_REPO=shiningstar1128/monkey_01.
  3. Commit: uv run vocence miner commit --model-name shiningstar1128/monkey_01 --model-revision <sha> --chute-id <uuid>.

See miner_sample/MINER_GUIDE.md for anti-cheat rules.