|
|
--- |
|
|
title: Carsa AI - African Translation API |
|
|
emoji: π |
|
|
colorFrom: blue |
|
|
colorTo: green |
|
|
sdk: docker |
|
|
app_port: 7860 |
|
|
pinned: false |
|
|
license: mit |
|
|
short_description: African language translation API with TTS/STT |
|
|
tags: |
|
|
- translation |
|
|
- speech-recognition |
|
|
- text-to-speech |
|
|
- african-languages |
|
|
- fastapi |
|
|
- whisper |
|
|
--- |
|
|
|
|
|
# π Carsa AI - African Language Translation API |
|
|
|
|
|
[](https://huggingface.co/spaces/CarsaAI/carsa_api) |
|
|
[](https://fastapi.tiangolo.com) |
|
|
[](https://python.org) |
|
|
|
|
|
A comprehensive AI-powered API for African language translation, speech-to-text, and text-to-speech capabilities. |
|
|
|
|
|
## π― Features |
|
|
|
|
|
- **π Translation**: English β 4 African languages (Twi, Ga, Ewe, Hausa) |
|
|
- **π€ Speech-to-Text**: Advanced ASR using Whisper |
|
|
- **π Text-to-Speech**: High-quality voice synthesis with 109+ voices |
|
|
- **β‘ Fast API**: Optimized for mobile applications |
|
|
- **π Global Access**: Hosted on Hugging Face Spaces |
|
|
|
|
|
## π± Supported Languages |
|
|
|
|
|
| Language | Code | Region | |
|
|
|----------|------|--------| |
|
|
| Twi | `twi` | Ghana | |
|
|
| Ga | `ga` | Ghana | |
|
|
| Ewe | `ewe` | Ghana/Togo | |
|
|
| Hausa | `hausa` | Nigeria/Niger | |
|
|
|
|
|
## π API Endpoints |
|
|
|
|
|
### Translation |
|
|
```http |
|
|
POST /translate |
|
|
Content-Type: application/json |
|
|
|
|
|
{ |
|
|
"text": "Hello, how are you?", |
|
|
"target_lang": "twi" |
|
|
} |
|
|
``` |
|
|
|
|
|
### Speech-to-Text |
|
|
```http |
|
|
POST /speech-to-text |
|
|
Content-Type: multipart/form-data |
|
|
|
|
|
audio_file: [WAV file] |
|
|
``` |
|
|
|
|
|
### Text-to-Speech |
|
|
```http |
|
|
POST /text-to-speech |
|
|
Content-Type: application/json |
|
|
|
|
|
{ |
|
|
"text": "Hello world" |
|
|
} |
|
|
``` |
|
|
|
|
|
## π₯ Health Check |
|
|
```http |
|
|
GET /health |
|
|
``` |
|
|
|
|
|
## π Interactive Documentation |
|
|
|
|
|
Visit `/docs` for complete interactive API documentation. |
|
|
|
|
|
## π οΈ Technology Stack |
|
|
|
|
|
- **FastAPI**: Modern Python web framework |
|
|
- **Transformers**: Hugging Face model library |
|
|
- **Whisper**: OpenAI speech recognition |
|
|
- **Coqui TTS**: Text-to-speech synthesis |
|
|
- **PyTorch**: Deep learning framework |
|
|
|
|
|
## π Performance |
|
|
|
|
|
- **Translation**: ~200ms response time |
|
|
- **Speech-to-Text**: ~2-5 seconds processing |
|
|
- **Text-to-Speech**: ~1-3 seconds generation |
|
|
- **Uptime**: 99.9%+ on Hugging Face Spaces |
|
|
|
|
|
## π± Mobile Integration |
|
|
|
|
|
Perfect for Flutter, React Native, or any mobile framework. Optimized for: |
|
|
- Low latency responses |
|
|
- Efficient data usage |
|
|
- Reliable connectivity |
|
|
- Global accessibility |
|
|
|
|
|
--- |
|
|
|
|
|
**Built with β€οΈ for African language preservation and accessibility** |
|
|
|
|
|
<!-- Trigger rebuild --> |