Spaces:

ranaspark
/

voice

Sleeping

voice / README.md

feat: unify React Dubbing Studio UI with Gradio Manga AI in single Docker container

337d4b1 21 days ago

1.11 kB

	---
	title: Maya Immersive Tamil Manga AI
	emoji: 🗣️
	colorFrom: indigo
	colorTo: purple
	sdk: docker
	pinned: false
	---

	# 🗣️ Multilingual Tamil TTS & Comic Reader AI

	An optimized text-to-speech pipeline specifically designed for low-memory hardware (8GB RAM).

	## 🚀 Features
	- 📖 Comic Reader Mode: Render PDF comic pages as images and read speech bubbles aloud.
	- 🎭 Expressive Voices: Multiple voice tones (Seductive, Excited, Dramatic) using Edge TTS.
	- 📄 Document Support: Extracts text from PDF, DOCX, and TXT files.
	- 🧠 Hybrid Pipeline: Uses Cloud APIs for TTS and Translation to save local RAM.
	- ⚙️ Optimized OCR: HD quadrant scanning for accurate text detection in comics.

	## 🛠️ Installation
	1. Install Python 3.10+
	2. Clone the repository: `git clone https://github.com/Rahul-new/voice.git`
	3. Install dependencies: `pip install -r requirements.txt`
	4. Run the app: `python app.py`

	## 🎨 Voice Styles
	- Cheerful (Maya)
	- Soft & Seductive (Maya)
	- Excited & High-Pitch (Maya)
	- Deep & Sensual (Sita)
	- Dramatic Narrator (Sita)