indicF5

Sleeping

App Files Files Community

ashishkblink commited on Jan 5

Commit

72ab360

verified ·

1 Parent(s): fb90de7

Upload README.md with huggingface_hub

Browse files

Files changed (1) hide show

README.md +49 -38

README.md CHANGED Viewed

@@ -1,61 +1,72 @@
 ---
-title: Vakya TTS Playground
-emoji: 🎤
-colorFrom: purple
-colorTo: pink
 sdk: gradio
-sdk_version: 6.2.0
 app_file: app.py
 pinned: false
-license: apache-2.0
 ---
-# 🎤 Vakya TTS Playground
-**India's No. 1 TTS Model for Hindi and Other Indian Languages**
-Interactive playground to test and experience the power of Vakya TTS - a state-of-the-art Text-to-Speech model fine-tuned from XTTS-v2, specifically optimized for Hindi and other Indian languages.
-## 🎯 Features
-- **High-quality Hindi TTS** - Optimized specifically for Hindi pronunciation and intonation
-- **Multi-Indian Language Support** - Supports 10+ Indian languages
-- **Voice Cloning** - Clone voices from just 6 seconds of audio
-- **Real-time Synthesis** - Fast and efficient speech generation
-- **Natural Sounding** - Human-like voice quality
 ## 🚀 How to Use
-1. **Enter Text**: Type or paste your text in the text box
-2. **Select Language**: Choose from Hindi, English, Marathi, Telugu, Tamil, Kannada, Gujarati, Punjabi, Bengali, or Urdu
-3. **Upload Speaker Audio (Optional)**: Upload a 6+ second audio file to clone the voice
-4. **Generate**: Click "Generate Speech" and enjoy the output!
-## 📊 Supported Languages
-- Hindi (hi) - Primary focus
-- English (en)
-- Marathi (mr)
-- Telugu (te)
-- Tamil (ta)
-- Kannada (kn)
-- Gujarati (gu)
-- Punjabi (pa)
-- Bengali (bn)
-- Urdu (ur)
-## 🔗 Model Repository
-The model is available at: [ashishkblink/vakya](https://huggingface.co/ashishkblink/vakya)
-## 📄 License
-Apache 2.0
-## 👤 Author
-ashishkblink
 ---
-*Built with ❤️ for the Indian language community*

 ---
+title: Vakya 2.0 - Text-to-Speech
+emoji: 🎙️
+colorFrom: blue
+colorTo: purple
 sdk: gradio
+sdk_version: 4.0.0
 app_file: app.py
 pinned: false
+license: mit
 ---
+# 🎙️ Vakya 2.0 - Text-to-Speech Playground
+**Vakya** is a high-quality Text-to-Speech model based on the IndicF5 architecture, supporting **11 Indian languages**.
+## 🌟 Features
+- **Multi-language Support**: Assamese, Bengali, Gujarati, Hindi, Kannada, Malayalam, Marathi, Odia, Punjabi, Tamil, Telugu
+- **Voice Cloning**: Uses reference audio to clone voice characteristics
+- **High Quality**: 24kHz sample rate, 0.4B parameter model
+- **Easy to Use**: Simple interface for testing and experimentation
 ## 🚀 How to Use
+1. **Load Model**: Click the "Load Model" button (first time may take a few minutes to download)
+2. **Upload Reference Audio**: Upload a short audio clip (<15 seconds recommended) that represents the voice you want to clone
+3. **Enter Reference Text** (Optional): Type what is spoken in the reference audio. If left blank, the model will auto-transcribe it
+4. **Enter Text to Generate**: Type the text you want to synthesize in any supported language
+5. **Adjust Settings** (Optional):
+   - Speed: Control the speech rate (0.5x to 2.0x)
+   - Remove Silences: Experimental feature to remove pauses
+6. **Generate**: Click "Generate Speech" and wait for the audio output
+## 📋 Model Information
+- **Model**: Vakya 2.0
+- **Repository**: [ashishkblink/vakya2.0](https://huggingface.co/ashishkblink/vakya2.0)
+- **Based on**: [IndicF5](https://github.com/AI4Bharat/IndicF5) by AI4Bharat (IIT Madras)
+- **Model Size**: 0.4B parameters
+- **Sample Rate**: 24000 Hz
+- **Training Data**: 1417 hours of high-quality speech
+- **License**: MIT License
+## 💡 Tips for Best Results
+- Keep reference audio clips short (<15 seconds) for best results
+- Use clear, high-quality reference audio
+- Provide reference text when possible for better voice matching
+- The model works best with native speakers of the target language
+## ⚠️ Terms of Use
+- You must have explicit permission to clone voices
+- Unauthorized voice cloning is strictly prohibited
+- Any misuse of this model is the responsibility of the user
+- This model is for research and educational purposes
+## 🔗 Links
+- **Model Repository**: [ashishkblink/vakya2.0](https://huggingface.co/ashishkblink/vakya2.0)
+- **GitHub**: [ashishkblink/vakya](https://github.com/ashishkblink/vakya)
+- **IndicF5**: [AI4Bharat/IndicF5](https://github.com/AI4Bharat/IndicF5)
+## 🙏 Acknowledgments
+This model is based on **IndicF5** developed by AI4Bharat (IIT Madras).
 ---
+**Vakya** - Bringing voices to Indian languages 🎙️