STT / README.md
github-actions[bot]
Auto-deploy from GitHub: 898f681ae67425055f39b96fde956658f2fe0d29
f145097
metadata
title: STT Audio Caption Generator
emoji: 🎡
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false
license: mit

Audio Caption Generator

A Python-based audio transcription service with a neobrutalist web interface.

Features

  • 🎡 Audio file upload via REST API
  • πŸ€– Automatic STT processing using faster-whisper
  • πŸ’Ύ SQLite database for queue management
  • 🎨 Neobrutalist UI with smooth animations
  • πŸ”„ Real-time status updates

Usage

Access the web interface at the Space URL above.

API Endpoints

  • POST /api/upload - Upload audio file
  • GET /api/files - Get all files
  • GET /api/files/<id> - Get specific file

Supported Formats

WAV, MP3, FLAC, OGG, M4A, AAC


Auto-deployed from GitHub