Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
|
@@ -10,18 +10,19 @@ health_check_path: /health
|
|
| 10 |
|
| 11 |
# Talklas API
|
| 12 |
|
| 13 |
-
This FastAPI app is deployed on Hugging Face Spaces for audio transcription, translation, and text-to-speech (TTS). The
|
| 14 |
|
| 15 |
- `/`: Returns a simple health check response.
|
| 16 |
- `/health`: Health check endpoint for Hugging Face Spaces.
|
| 17 |
- `/update-languages`: Updates the source and target languages for STT and TTS models.
|
| 18 |
- `/translate-text`: Translates text and converts it to speech.
|
| 19 |
- `/translate-audio`: Transcribes audio, translates the text, and converts the translated text to speech. Includes speech detection to handle silent audio gracefully.
|
|
|
|
| 20 |
|
| 21 |
## Features
|
| 22 |
|
| 23 |
- **Speech Detection**: The `/translate-audio` endpoint detects if the audio is silent (no speech) and returns a user-friendly response.
|
| 24 |
-
- **Transcription (STT)**: Uses MMS
|
| 25 |
- **Translation (MT)**: Uses the NLLB-200 model to translate text between supported languages.
|
| 26 |
- **Text-to-Speech (TTS)**: Uses MMS-TTS models to convert translated text to speech.
|
| 27 |
|
|
|
|
| 10 |
|
| 11 |
# Talklas API
|
| 12 |
|
| 13 |
+
This FastAPI app is deployed on Hugging Face Spaces for audio transcription, translation, and text-to-speech (TTS). The application loads all necessary models at startup and updates each model on demand, including a feature for detecting inappropriate language. It includes the following endpoints:
|
| 14 |
|
| 15 |
- `/`: Returns a simple health check response.
|
| 16 |
- `/health`: Health check endpoint for Hugging Face Spaces.
|
| 17 |
- `/update-languages`: Updates the source and target languages for STT and TTS models.
|
| 18 |
- `/translate-text`: Translates text and converts it to speech.
|
| 19 |
- `/translate-audio`: Transcribes audio, translates the text, and converts the translated text to speech. Includes speech detection to handle silent audio gracefully.
|
| 20 |
+
- `/text-to-speech`: This will generate a speech from the text and target language given to the server
|
| 21 |
|
| 22 |
## Features
|
| 23 |
|
| 24 |
- **Speech Detection**: The `/translate-audio` endpoint detects if the audio is silent (no speech) and returns a user-friendly response.
|
| 25 |
+
- **Transcription (STT)**: Uses Whisper for English and Tagalog source language and MMS for other Philippine languages to transcribe audio.
|
| 26 |
- **Translation (MT)**: Uses the NLLB-200 model to translate text between supported languages.
|
| 27 |
- **Text-to-Speech (TTS)**: Uses MMS-TTS models to convert translated text to speech.
|
| 28 |
|