ckpt-800 β€” merged

Self-contained snapshot of magma90909/vocence_miner_v3 with the ckpt-800 LoRA adapter folded into the Talker weights. No PEFT layers at runtime β€” load directly with Qwen3TTSModel.from_pretrained.

Quick start

from qwen_tts import Qwen3TTSModel

wrap = Qwen3TTSModel.from_pretrained("ratrys/sft-tts-800")
wavs, sr = wrap.generate_voice_design(
    text="Hello, this is a test.",
    instruct="A young adult female speaker speaks calmly at a normal pace.",
    language="english",
    temperature=0.9, top_p=1.0, top_k=50,
    repetition_penalty=1.05, max_new_tokens=600,
)

Repository layout

Same shape as the upstream macminix repo:

  • config.json, model.safetensors β€” Qwen3-TTS Talker + Code Predictor (merged)
  • speech_tokenizer/ β€” 12.5 fps Γ— 16 codebook neural codec (unchanged)
  • tokenizer.*, vocab.json, merges.txt, added_tokens.json, special_tokens_map.json β€” Qwen2 BPE tokenizer
  • generation_config.json, preprocessor_config.json
  • vocence_config.yaml, chute_config.yml β€” runtime + Chutes deploy hints

You can drop your own miner.py into this repo (same contract as macminix's: class Miner with __init__(path_hf_repo: Path), warmup(), generate_wav(instruction, text) β†’ (np.ndarray, int)); the standard Vocence chute wrapper will load this model unchanged.

Provenance

See merge_info.json for the exact base path, adapter path, LoRA hyperparameters, and merge timestamp.

Downloads last month
31
Safetensors
Model size
2B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for matthewliu0302/sft-tts-2000

Finetuned
(4)
this model