Thorsten-Voice – Orpheus TTS v2 (Mini Fine-Tuned)

Overview

Thorsten-Voice/tv-orpheus-v2 is an improved version of tv-orpheus-v1, further optimized to better match the natural speaking style of the original speaker.

It was fine-tuned using a small, carefully curated mini dataset (60 recordings, TV-24kHz-2025.12-Neutral-FT-Mini) recorded in everyday speech situations, focusing on:

calm explanations
natural pauses
neutral statements
questions
short and long sentences

Training date / codebase reference: December 2025

Training Data

Base Training (v1)

~12,000 recordings
Thorsten-Voice Dataset 2022.10
Sample rate: 24 kHz
License: CC0

Additional Fine-Tuning (v2)

Dataset: Thorsten-Voice/TV-24kHz-2025.12-Neutral-FT-Mini
Number of recordings: 60
Sample rate: 24 kHz
Content: neutral, natural everyday speech
License: CC0 (Public Domain)

The mini dataset was designed to pull the synthesized voice closer to the real speaker, improving authenticity and reducing synthetic artifacts.

Model Lineage

Base model: canopylabs/3b-de-ft-research_release
Intermediate model: Thorsten-Voice/tv-orpheus-v1
Final fine-tuning: 60-sample neutral mini dataset
TTS framework: Orpheus TTS

License Clarification

Model weights:
Licensed under Apache-2.0, inherited from the Orpheus base model.
Training data & voice:
Released under CC0 (Public Domain) by the speaker.

No additional usage restrictions are imposed.

Usage Example

python inference_hf.py \
  --model_path Thorsten-Voice/tv-orpheus-v2 \
  --text "Für mich sind alle Menschen gleich, unabhängig von Herkunft oder Religion." \
  --outfile output.wav

TTS generation script (inference_hf.py) is included in repo (see files) for easier generation of Thorsten-Voice powered by OrpheusTTS.

Notes on Voice Quality

Compared to v1, this model:

sounds closer to the real speaker
uses a more natural rhythm
shows improved intonation in longer sentences
Minor artifacts such as occasional unexpected pauses may still occur and are subject to future refinements.

Acknowledgements

Special thanks to Orpheus TTS and its authors for providing an open and high-quality German TTS foundation.

Orpheus TTS GitHub: https://github.com/canopylabs/orpheus-tts
Base model by: Canopy Labs

Thorsten-Voice Project

Thorsten-Voice is an open voice project aimed at providing freely usable German speech data and models.

Project page: https://www.Thorsten-Voice.de

Downloads last month: 4

Safetensors

Model size

3B params

Tensor type

BF16

Model tree for Thorsten-Voice/tv-orpheus-v2

Quantizations

1 model