license: apache-2.0
tags:
- voice-changer
- rvc
- spin
- f0-categorized
- non-speech
- pretrained-model
datasets:
- custom
language:
- multilingual
๐ KLM BlackTone Large (Pretrained Voice Changer Model)
KLM BlackTone Large is a pretrained model for real-time voice conversion, built using the Spin Embedder. It was trained on a massive dataset of over 11,000 hours from 595 speakers, carefully categorized by F0 range
๐ Requirements
To use this model without errors, you must use Applio v3.2.9 or later, specifically versions updated after July 15.
Even if your Applio version is labeled 3.2.9, models updated before July 15 may still result in mismatch errors.
๐ Key Features
- โ 595 F0-Categorized Speakers
- โ 11,000+ hours of curated training data
- โ Spin Embedder for flexible and generalizable voice transfer
- โ Real-time inference support
- โ
Extensive support for non-verbal sounds such as:
- Coughs
- Laughter
- Whispers
- And other expressive human vocal behaviors
These features are made possible by including a large proportion of non-speech data in the training set.
๐ Recommended Usage
To fully utilize BlackToneโs capabilitiesโespecially non-speech inferenceโyou should include 10โ20% non-speech data in your fine-tuning dataset.
If you do not have such data, you can try setting the Feature Index to 0, which may enable limited inference of non-verbal sounds. However, non-speech training data is highly recommended for best results.
๐ License
Apache 2.0
๐ซ Contact
For usage reports, contributions, or collaborations, please open an issue or contact the model maintainer.