TruthShieldAIVoiceGen / MODEL_CARD.md
prabindersinghh's picture
Upload 2 files
634a97b verified
metadata
license: apache-2.0
language:
  - en
  - hi
  - bn
  - te
  - ta
  - kn
  - mr
  - gu
tags:
  - text-to-speech
  - tts
  - voice-cloning
  - multilingual
library_name: coqui-tts

TruthShield VoiceGen

Model Description

TruthShield VoiceGen is a multi-speaker, multilingual text-to-speech model with accent and style transfer capabilities. Built for the Voice Tech For All Challenge, it supports 11 Indian and English languages with forensic speaker verification.

Supported Languages

  • Bhojpuri, Bengali, English, Gujarati, Hindi
  • Chhattisgarhi, Kannada, Magahi, Maithili, Marathi, Telugu

Model Architecture

  • Core: VITS (Variational Inference TTS)
  • Speaker Encoder: ECAPA-TDNN
  • Vocoder: HiFiGAN

Intended Use

  • Accessibility applications
  • Educational content
  • Regional language content creation
  • Voice assistants

Limitations

  • Requires speaker reference audio (WAV format)
  • English text must be lowercase
  • Maximum text length: 5000 characters

Ethical Considerations

  • Built-in safety verification prevents unauthorized cloning
  • All generated audio includes forensic watermarking
  • Consent required for voice cloning

Training Data

  • SYSPIN Indian Languages Dataset
  • SpiCor Indian English Accents
  • See datasets.csv for supplementary data

Citation

@misc{truthshield2024voicegen, title={TruthShield VoiceGen: Multi-Speaker Multilingual TTS}, author={TruthShield Team}, year={2024}, publisher={HuggingFace} }