Thorsten-Voice – Orpheus TTS v2 (Mini Fine-Tuned)

Overview

Thorsten-Voice/tv-orpheus-v2 is an improved version of tv-orpheus-v1, further optimized to better match the natural speaking style of the original speaker.

It was fine-tuned using a small, carefully curated mini dataset (60 recordings, TV-24kHz-2025.12-Neutral-FT-Mini) recorded in everyday speech situations, focusing on:

  • calm explanations
  • natural pauses
  • neutral statements
  • questions
  • short and long sentences

Training date / codebase reference: December 2025


Training Data

Base Training (v1)

  • ~12,000 recordings
  • Thorsten-Voice Dataset 2022.10
  • Sample rate: 24 kHz
  • License: CC0

Additional Fine-Tuning (v2)

  • Dataset: Thorsten-Voice/TV-24kHz-2025.12-Neutral-FT-Mini
  • Number of recordings: 60
  • Sample rate: 24 kHz
  • Content: neutral, natural everyday speech
  • License: CC0 (Public Domain)

The mini dataset was designed to pull the synthesized voice closer to the real speaker, improving authenticity and reducing synthetic artifacts.


Model Lineage

  • Base model: canopylabs/3b-de-ft-research_release
  • Intermediate model: Thorsten-Voice/tv-orpheus-v1
  • Final fine-tuning: 60-sample neutral mini dataset
  • TTS framework: Orpheus TTS

License Clarification

  • Model weights:
    Licensed under Apache-2.0, inherited from the Orpheus base model.

  • Training data & voice:
    Released under CC0 (Public Domain) by the speaker.

No additional usage restrictions are imposed.


Usage Example

python inference_hf.py \
  --model_path Thorsten-Voice/tv-orpheus-v2 \
  --text "Für mich sind alle Menschen gleich, unabhängig von Herkunft oder Religion." \
  --outfile output.wav

TTS generation script (inference_hf.py) is included in repo (see files) for easier generation of Thorsten-Voice powered by OrpheusTTS.

Notes on Voice Quality

Compared to v1, this model:

  • sounds closer to the real speaker
  • uses a more natural rhythm
  • shows improved intonation in longer sentences
  • Minor artifacts such as occasional unexpected pauses may still occur and are subject to future refinements.

Acknowledgements

Special thanks to Orpheus TTS and its authors for providing an open and high-quality German TTS foundation.

Thorsten-Voice Project

Thorsten-Voice is an open voice project aimed at providing freely usable German speech data and models.

Downloads last month
32
Safetensors
Model size
3B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Thorsten-Voice/tv-orpheus-v2

Quantizations
1 model