Jerich commited on
Commit
60c7aa2
·
verified ·
1 Parent(s): 0ec414d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -10,18 +10,19 @@ health_check_path: /health
10
 
11
  # Talklas API
12
 
13
- This FastAPI app is deployed on Hugging Face Spaces for audio transcription, translation, and text-to-speech (TTS). The model loads all at once and includes a feature for detecting inappropriate language. It includes the following endpoints:
14
 
15
  - `/`: Returns a simple health check response.
16
  - `/health`: Health check endpoint for Hugging Face Spaces.
17
  - `/update-languages`: Updates the source and target languages for STT and TTS models.
18
  - `/translate-text`: Translates text and converts it to speech.
19
  - `/translate-audio`: Transcribes audio, translates the text, and converts the translated text to speech. Includes speech detection to handle silent audio gracefully.
 
20
 
21
  ## Features
22
 
23
  - **Speech Detection**: The `/translate-audio` endpoint detects if the audio is silent (no speech) and returns a user-friendly response.
24
- - **Transcription (STT)**: Uses MMS or Whisper models to transcribe audio.
25
  - **Translation (MT)**: Uses the NLLB-200 model to translate text between supported languages.
26
  - **Text-to-Speech (TTS)**: Uses MMS-TTS models to convert translated text to speech.
27
 
 
10
 
11
  # Talklas API
12
 
13
+ This FastAPI app is deployed on Hugging Face Spaces for audio transcription, translation, and text-to-speech (TTS). The application loads all necessary models at startup and updates each model on demand, including a feature for detecting inappropriate language. It includes the following endpoints:
14
 
15
  - `/`: Returns a simple health check response.
16
  - `/health`: Health check endpoint for Hugging Face Spaces.
17
  - `/update-languages`: Updates the source and target languages for STT and TTS models.
18
  - `/translate-text`: Translates text and converts it to speech.
19
  - `/translate-audio`: Transcribes audio, translates the text, and converts the translated text to speech. Includes speech detection to handle silent audio gracefully.
20
+ - `/text-to-speech`: This will generate a speech from the text and target language given to the server
21
 
22
  ## Features
23
 
24
  - **Speech Detection**: The `/translate-audio` endpoint detects if the audio is silent (no speech) and returns a user-friendly response.
25
+ - **Transcription (STT)**: Uses Whisper for English and Tagalog source language and MMS for other Philippine languages to transcribe audio.
26
  - **Translation (MT)**: Uses the NLLB-200 model to translate text between supported languages.
27
  - **Text-to-Speech (TTS)**: Uses MMS-TTS models to convert translated text to speech.
28