Spaces:

MahmoudElsamadony
/

TTV

Build error

TTV / README.md

ADD app APIs settled

e3ff092 4 months ago

2.83 kB

A newer version of the Streamlit SDK is available: 1.57.0

title: Arabic TTS (MMS)
emoji: 🗣️
colorFrom: indigo
colorTo: green
sdk: streamlit
sdk_version: 1.38.0
app_file: app.py
pinned: false

Arabic TTS Streamlit App

Arabic text-to-speech demo powered by open-source Hugging Face checkpoints:

Local inference for MMS, community VITS, SpeechT5, and XTTS engines (GPU-aware via PyTorch)
Sidebar model download button for caching checkpoints under models_cache/
Downloadable app.log for easy debugging/support
Hugging Face token input if you later add hosted models requiring authentication

sudo apt-get update && sudo apt-get install -y python3-venv espeak-ng
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Example (CUDA 12.1):

pip install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu121

CPU-only fallback:

pip install torch --index-url https://download.pytorch.org/whl/cpu

streamlit run app.py

Open the printed URL (usually http://localhost:8501).

Pick an Arabic model in the sidebar.
Expand Model assets to download weights locally (recommended for MMS/VITS/SpeechT5).
Enter Arabic text and click 🔊 Generate Speech.
Download app.log from the sidebar if you need to inspect or share errors.

StopIteration from hosted inference: current models run locally only.
Message "النص المدخل لم ينتج أي رموز…": the tokenizer stripped everything. Add Arabic letters or a longer phrase.
SpeechT5 downloads HiFi-GAN + CMU x-vectors the first time; keep the tab open.
SpeechT5 also requires sentencepiece (already listed in requirements.txt). If you see an import error, re-run pip install sentencepiece and restart the app.
For AhmedEladl/saudi-tts (XTTS-style), provide local paths in the sidebar: config.json, vocab.json, the checkpoint directory containing the .pth file, and a speaker.wav. If you don't have the Coqui TTS library installed, run pip install TTS.
CUDA OOM: shorten the text input or set CUDA_VISIBLE_DEVICES="" to force CPU inference.