Habibi-TTS ALG Production Optimization

Production-ready optimization scripts for the Habibi-TTS Algerian Arabic (ALG) specialized model.

Research Validation Summary

Claim	Status	Evidence
EPSS / `get_epss_timesteps()`	TRUE	Built into F5-TTS v1.1.20+ since May 2025. Auto-applies when `steps ∈ {5,6,7,10,12,16}`
Sway Sampling	TRUE	Native to F5-TTS, default `sway_sampling_coef=-1.0`
F5R-TTS (29.5% WER)	UNCONFIRMED	Paper not publicly indexed. GRPO for TTS validated by DMOSpeech 2 (~10% WER improvement)
Triton/TensorRT in F5-TTS	FALSE	No off-the-shelf support. Triton runtime exists but requires manual setup
SGLang/vLLM for TTS	FALSE	Architecturally incompatible. F5-TTS is DiT+flow-matching, not autoregressive LLM
TGI maintenance mode	TRUE	Official HF docs confirm maintenance mode, recommend vLLM/SGLang for LLMs
FP8 on A10G	FALSE	A10G (Ampere/SM80) does NOT support FP8. Use BF16 + INT8 instead
Arabic diacritization	TRUE	Sadeed (Misraj/Sadeed) is SOTA for MSA. Algerian dialect needs dialect-aware preprocessing

EPSS (Empirically Pruned Step Sampling) - 4x speedup with minimal quality loss.

BF16 inference + torch.compile for A10G.

Algerian Arabic text preprocessing pipeline.

FastAPI streaming TTS server.

INT8 weight-only quantization for A10G.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support