File size: 1,483 Bytes
634a97b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
---
license: apache-2.0
language:
  - en
  - hi
  - bn
  - te
  - ta
  - kn
  - mr
  - gu
tags:
  - text-to-speech
  - tts
  - voice-cloning
  - multilingual
library_name: coqui-tts
---

# TruthShield VoiceGen

## Model Description

TruthShield VoiceGen is a multi-speaker, multilingual text-to-speech model with accent and style transfer capabilities. Built for the Voice Tech For All Challenge, it supports 11 Indian and English languages with forensic speaker verification.

## Supported Languages

- Bhojpuri, Bengali, English, Gujarati, Hindi
- Chhattisgarhi, Kannada, Magahi, Maithili, Marathi, Telugu

## Model Architecture

- **Core**: VITS (Variational Inference TTS)
- **Speaker Encoder**: ECAPA-TDNN
- **Vocoder**: HiFiGAN

## Intended Use

- Accessibility applications
- Educational content
- Regional language content creation
- Voice assistants

## Limitations

- Requires speaker reference audio (WAV format)
- English text must be lowercase
- Maximum text length: 5000 characters

## Ethical Considerations

- Built-in safety verification prevents unauthorized cloning
- All generated audio includes forensic watermarking
- Consent required for voice cloning

## Training Data

- SYSPIN Indian Languages Dataset
- SpiCor Indian English Accents
- See datasets.csv for supplementary data

## Citation

@misc{truthshield2024voicegen,
  title={TruthShield VoiceGen: Multi-Speaker Multilingual TTS},
  author={TruthShield Team},
  year={2024},
  publisher={HuggingFace}
}