File size: 1,483 Bytes

634a97b

---
license: apache-2.0
language:
  - en
  - hi
  - bn
  - te
  - ta
  - kn
  - mr
  - gu
tags:
  - text-to-speech
  - tts
  - voice-cloning
  - multilingual
library_name: coqui-tts
---

# TruthShield VoiceGen

## Model Description

TruthShield VoiceGen is a multi-speaker, multilingual text-to-speech model with accent and style transfer capabilities. Built for the Voice Tech For All Challenge, it supports 11 Indian and English languages with forensic speaker verification.

## Supported Languages

- Bhojpuri, Bengali, English, Gujarati, Hindi
- Chhattisgarhi, Kannada, Magahi, Maithili, Marathi, Telugu

## Model Architecture

- **Core**: VITS (Variational Inference TTS)
- **Speaker Encoder**: ECAPA-TDNN
- **Vocoder**: HiFiGAN

## Intended Use

- Accessibility applications
- Educational content
- Regional language content creation
- Voice assistants

## Limitations

- Requires speaker reference audio (WAV format)
- English text must be lowercase
- Maximum text length: 5000 characters

## Ethical Considerations

- Built-in safety verification prevents unauthorized cloning
- All generated audio includes forensic watermarking
- Consent required for voice cloning

## Training Data

- SYSPIN Indian Languages Dataset
- SpiCor Indian English Accents
- See datasets.csv for supplementary data

## Citation

@misc{truthshield2024voicegen,
  title={TruthShield VoiceGen: Multi-Speaker Multilingual TTS},
  author={TruthShield Team},
  year={2024},
  publisher={HuggingFace}
}