metadata
title: STT Audio Caption Generator
emoji: π΅
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false
license: mit
Audio Caption Generator
A Python-based audio transcription service with a neobrutalist web interface.
Features
- π΅ Audio file upload via REST API
- π€ Automatic STT processing using faster-whisper
- πΎ SQLite database for queue management
- π¨ Neobrutalist UI with smooth animations
- π Real-time status updates
Usage
Access the web interface at the Space URL above.
API Endpoints
- POST
/api/upload- Upload audio file - GET
/api/files- Get all files - GET
/api/files/<id>- Get specific file
Supported Formats
WAV, MP3, FLAC, OGG, M4A, AAC
Auto-deployed from GitHub