German-TTS / README.md

Keyven

Upload README.md with huggingface_hub

58bf145 verified 10 days ago

preview code

raw

history blame contribute delete

2.02 kB

metadata

license: cc-by-nc-4.0
language:
  - de
  - en
tags:
  - text-to-speech
  - tts
  - german
  - voice-cloning
  - zero-shot
  - emotional-tts
pipeline_tag: text-to-speech

German-TTS

german-tts.de - Hochoptimiertes deutsches Text-to-Speech System mit Zero-Shot Voice Cloning.

Features

Deutsche Sprachsynthese - Optimiert für native deutsche Aussprache
Zero-Shot Voice Cloning - Klone jede Stimme mit 3-10s Audio
Emotionale Sprache (EN) - Happy, Sad, Angry, Surprise
Geschwindigkeitskontrolle - 0.5x bis 2.0x
51 Stimmen - 22 Deutsche + 29 Englische

Schnellstart

from german_tts import GermanTTS

tts = GermanTTS()

# Synthese
audio = tts.synthesize("Guten Tag! Wie geht es Ihnen?")
audio.save("output.wav")

# Mit Voice Cloning
audio = tts.synthesize(
    "Das ist ein Test.",
    reference_audio="stimme.wav",
    speed=1.0
)

Modelle

Datei	Größe	Beschreibung
`german_tts_base.safetensors`	~1.3 GB	Deutsches Hauptmodell
`german_tts_dit.pt`	~1.3 GB	DiT Version
`german_tts_dit_fp16.pt`	~650 MB	FP16 (schneller)
`german_tts_dit_int8.pt`	~340 MB	INT8 (am schnellsten)
`vocab.txt`	2 KB	Vokabular
`emotional_en/`	~2.6 GB	Emotionale EN Modelle

Text-Normalisierung

Eingabe	Ausgabe
`10€`	zehn Euro
`14:30 Uhr`	vierzehn Uhr dreißig
`Dr. Müller`	Doktor Müller

Lizenz

CC-BY-NC-4.0 (Nicht-kommerziell)

Kommerzielle Lizenz: info@keyvan.ai

Entwickelt von Keyvan.ai