File size: 740 Bytes
c4576b6
f145097
 
 
 
c4576b6
 
f145097
c4576b6
 
f145097
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
---
title: STT Audio Caption Generator
emoji: 🎡
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false
license: mit
---

# Audio Caption Generator

A Python-based audio transcription service with a neobrutalist web interface.

## Features
- 🎡 Audio file upload via REST API
- πŸ€– Automatic STT processing using faster-whisper
- πŸ’Ύ SQLite database for queue management
- 🎨 Neobrutalist UI with smooth animations
- πŸ”„ Real-time status updates

## Usage
Access the web interface at the Space URL above.

## API Endpoints
- POST `/api/upload` - Upload audio file
- GET `/api/files` - Get all files
- GET `/api/files/<id>` - Get specific file

## Supported Formats
WAV, MP3, FLAC, OGG, M4A, AAC

---
*Auto-deployed from GitHub*