SeoulStreamingStation's picture
Update ReadMe.md
f4a9d34 verified
metadata
license: apache-2.0
tags:
  - voice-changer
  - rvc
  - spin
  - f0-categorized
  - non-speech
  - pretrained-model
datasets:
  - custom
language:
  - multilingual

๐Ÿ”Š KLM BlackTone Large (Pretrained Voice Changer Model)

KLM BlackTone Large is a pretrained model for real-time voice conversion, built using the Spin Embedder. It was trained on a massive dataset of over 11,000 hours from 595 speakers, carefully categorized by F0 range


๐Ÿš€ Requirements

To use this model without errors, you must use Applio v3.2.9 or later, specifically versions updated after July 15.
Even if your Applio version is labeled 3.2.9, models updated before July 15 may still result in mismatch errors.


๐ŸŒŸ Key Features

  • โœ… 595 F0-Categorized Speakers
  • โœ… 11,000+ hours of curated training data
  • โœ… Spin Embedder for flexible and generalizable voice transfer
  • โœ… Real-time inference support
  • โœ… Extensive support for non-verbal sounds such as:
    • Coughs
    • Laughter
    • Whispers
    • And other expressive human vocal behaviors

These features are made possible by including a large proportion of non-speech data in the training set.


๐Ÿ“Š Recommended Usage

To fully utilize BlackToneโ€™s capabilitiesโ€”especially non-speech inferenceโ€”you should include 10โ€“20% non-speech data in your fine-tuning dataset.

If you do not have such data, you can try setting the Feature Index to 0, which may enable limited inference of non-verbal sounds. However, non-speech training data is highly recommended for best results.


๐Ÿ“Ž License

Apache 2.0


๐Ÿ“ซ Contact

For usage reports, contributions, or collaborations, please open an issue or contact the model maintainer.