File size: 911 Bytes
dde584b
9d8ae5e
dde584b
 
 
 
 
 
 
 
9d8ae5e
 
 
dde584b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
---
title: Busy Module Audio Features
emoji: "🎤"
colorFrom: indigo
colorTo: purple
sdk: docker
app_port: 7860
pinned: false
---

# Busy Module Audio Features

## Audio Feature Extraction API

This Space extracts 17 voice features from audio, including SNR, noise classification, speech rate, pitch, energy, pause analysis, and emotion features.

## API

**POST** `/extract-audio-features-base64`

```json
{
  "audio_base64": "<base64-encoded-wav>",
  "transcript": "I'm driving right now"
}
```

**POST** `/extract-audio-features` (multipart form)

- `audio`: audio file upload
- `transcript`: text transcript

**GET** `/health`

## Authentication

This Space requires access to private models. Add your Hugging Face token as a secret:

1. Go to **Settings** -> **Variables and secrets**.
2. Click **New secret**.
3. Name it `HF_TOKEN`.
4. Set the value to a Hugging Face access token with read permissions.