Spaces:

austinekurian
/

AIapps

Sleeping

App Files Files Community

austinekurian commited on Dec 16, 2025

Commit

a31c6d0

verified ·

1 Parent(s): ea79f5f

Update README.md

Browse files

Files changed (1) hide show

README.md +38 -41

README.md CHANGED Viewed

@@ -1,45 +1,42 @@
-# മലയാളം Text → AI Voice (Free)
-A free web app (Hugging Face Space, Gradio) that converts **Malayalam** text to speech using the **AI4Bharat VITS** model.
-## How it works
-- Loads the multi‑lingual Indian **VITS TTS** model `ai4bharat/vits_rasa_13`, which includes **Malayalam** voices and multiple **styles** (NEWS, BOOK, etc.).
-- Renders a simple Gradio UI: paste Malayalam text → click **Generate** → download audio.
-> Model reference: AI4Bharat VITS model with Malayalam support and style/speaker IDs.
-> Piper/Sherpa‑ONNX alternative for Malayalam also exists (`ml_IN-arjun`), if you prefer an ONNX path.
-## Deploy (Hugging Face Spaces)
-1. Create a new Space → **Gradio**.
-2. Upload these files: `app.py`, `requirements.txt`, `README.md`.
-3. The Space will build and start automatically.
-4. Share the public URL.
-## Usage
-- Default speaker is **MAL_F (11)**.
-- Try styles like **NEWS (10)** for crisp reading, **BOOK (3)** for long‑form, **ALEXA (0)** for neutral.
-## Local run (optional)
-```bash
-python -m venv .venv && source .venv/bin/activate
-pip install -r requirements.txt
-python app.py
-```
-## Licensing
-- App code: MIT (see below).
-- **Model license**: please review the license on the model page before commercial use.
-### MIT License (app code)
-Copyright (c) 2025
-Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction...
-```
-(standard MIT terms)
-```
-## New features
-- **Prosody sliders:** speaking rate (0.5–1.5) & pitch (−4…+4 semitones). Implemented via resampling (approximate).
-- **Batch paragraphs:** split on blank lines → one file per paragraph × style.
-- **MP3 alongside WAV:** via `pydub` + ffmpeg (present on Spaces). Falls back to WAV if MP3 fails.

+---
+title: Malayalam Text → AI Voice (Free)
+emoji: 🗣️
+colorFrom: indigo
+colorTo: green
+sdk: gradio
+sdk_version: 4.44.0
+app_file: app.py
+python_version: 3.10
+runtime: python
+pinned: false
+license: mit
+models:
+  - ai4bharat/vits_rasa_13
+---
+# മലയാളം Text → AI Voice (Free)
+A free web app (Hugging Face Space, Gradio) that converts **Malayalam** text to speech using the **AI4Bharat VITS** model.
+## Features
+- **Multiple voice styles** (ALEXA, NEWS, BOOK, etc.)
+- **Prosody controls**: Speaking rate & pitch (approximate via resampling)
+- **Batch paragraphs**: Split text by blank line → one file per paragraph × style
+- **WAV + MP3** output (MP3 requires `ffmpeg`)
+## Deploy / Run
+1. Ensure the files below are present in the repository:
+   - `app.py`
+   - `requirements.txt`
+   - `packages.txt` *(contains `ffmpeg` for MP3)*
+   - `LICENSE`
+2. Accept access to the gated model **ai4bharat/vits_rasa_13** on its model page (click “Access repository / Agree”).
+3. If you still get permission errors, add a read token as a Space secret:
+   - **Settings → Variables and secrets → New secret**
+   - Name: `HF_TOKEN` | Value: your Hugging Face read token
+4. Restart the Space.
+## Notes
+- Prosody controls are approximate (client-side resampling). For true SSML prosody, consider Azure AI Speech Malayalam neural voice (ml-IN-SobhanaNeural).
+``