File size: 671 Bytes
20e9481
 
32de4f6
 
 
20e9481
32de4f6
20e9481
32de4f6
20e9481
 
32de4f6
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
title: Vanta
emoji: πŸŽ™
colorFrom: gray
colorTo: gray
sdk: docker
app_port: 7860
pinned: false
license: mit
---

# Vanta β€” Target Speaker Extraction (backend)

FastAPI inference server for Vanta, a target speaker extraction (TSE) model.
Upload a short reference clip of a target speaker and a noisy recording; get
back the isolated voice of the target.

## Endpoints

- `GET  /health` β€” returns `{status, checkpoint, device}`
- `POST /extract` β€” `multipart/form-data` with fields `mixture` and `enrollment`.
  Pass `?include_residue=true` to also get the residue (what was removed) as JSON.

Frontend lives separately (Vercel / Next.js) and talks to this API.