TruthShieldAIVoiceGen / MODEL_CARD.md
prabindersinghh's picture
Upload 2 files
634a97b verified
---
license: apache-2.0
language:
- en
- hi
- bn
- te
- ta
- kn
- mr
- gu
tags:
- text-to-speech
- tts
- voice-cloning
- multilingual
library_name: coqui-tts
---
# TruthShield VoiceGen
## Model Description
TruthShield VoiceGen is a multi-speaker, multilingual text-to-speech model with accent and style transfer capabilities. Built for the Voice Tech For All Challenge, it supports 11 Indian and English languages with forensic speaker verification.
## Supported Languages
- Bhojpuri, Bengali, English, Gujarati, Hindi
- Chhattisgarhi, Kannada, Magahi, Maithili, Marathi, Telugu
## Model Architecture
- **Core**: VITS (Variational Inference TTS)
- **Speaker Encoder**: ECAPA-TDNN
- **Vocoder**: HiFiGAN
## Intended Use
- Accessibility applications
- Educational content
- Regional language content creation
- Voice assistants
## Limitations
- Requires speaker reference audio (WAV format)
- English text must be lowercase
- Maximum text length: 5000 characters
## Ethical Considerations
- Built-in safety verification prevents unauthorized cloning
- All generated audio includes forensic watermarking
- Consent required for voice cloning
## Training Data
- SYSPIN Indian Languages Dataset
- SpiCor Indian English Accents
- See datasets.csv for supplementary data
## Citation
@misc{truthshield2024voicegen,
title={TruthShield VoiceGen: Multi-Speaker Multilingual TTS},
author={TruthShield Team},
year={2024},
publisher={HuggingFace}
}