Spaces:
Running
Running
metadata
title: Vanta
emoji: π
colorFrom: gray
colorTo: gray
sdk: docker
app_port: 7860
pinned: false
license: mit
Vanta β Target Speaker Extraction (backend)
FastAPI inference server for Vanta, a target speaker extraction (TSE) model. Upload a short reference clip of a target speaker and a noisy recording; get back the isolated voice of the target.
Endpoints
GET /healthβ returns{status, checkpoint, device}POST /extractβmultipart/form-datawith fieldsmixtureandenrollment. Pass?include_residue=trueto also get the residue (what was removed) as JSON.
Frontend lives separately (Vercel / Next.js) and talks to this API.