--- title: Vanta emoji: 🎙 colorFrom: gray colorTo: gray sdk: docker app_port: 7860 pinned: false license: mit --- # Vanta — Target Speaker Extraction (backend) FastAPI inference server for Vanta, a target speaker extraction (TSE) model. Upload a short reference clip of a target speaker and a noisy recording; get back the isolated voice of the target. ## Endpoints - `GET /health` — returns `{status, checkpoint, device}` - `POST /extract` — `multipart/form-data` with fields `mixture` and `enrollment`. Pass `?include_residue=true` to also get the residue (what was removed) as JSON. Frontend lives separately (Vercel / Next.js) and talks to this API.