sanoramyun8
/

speaker-embedding-endpoint

Audio Classification

speaker-embedding

speaker-verification

Model card Files Files and versions

Speaker Embedding Endpoint

Custom HuggingFace Inference Endpoint for extracting speaker embeddings using SpeechBrain's ECAPA-TDNN model.

Model

This endpoint uses speechbrain/spkrec-ecapa-voxceleb model which achieves:

0.80% EER on VoxCeleb1 test set
192-dimensional speaker embeddings

Usage

API Request

curl -X POST \
  https://your-endpoint-url.endpoints.huggingface.cloud \
  -H "Authorization: Bearer YOUR_HF_TOKEN" \
  -H "Content-Type: audio/wav" \
  --data-binary "@audio.wav"

Response

{
  "embedding": [0.123, -0.456, ...],
  "dimension": 192,
  "model": "speechbrain/spkrec-ecapa-voxceleb"
}

Speaker Verification

To verify if two audio files are from the same speaker:

Extract embeddings from both audio files
Calculate cosine similarity between embeddings
If similarity > 0.6 (threshold), same speaker

from scipy.spatial.distance import cosine

similarity = 1 - cosine(embedding1, embedding2)
is_same_speaker = similarity > 0.6

Project

Part of Deep Truth - AI Deepfake Voice Detection & Speaker Verification Service

GitHub: https://github.com/yonghwan1106/deep-truth
Demo: https://deep-truth.vercel.app

Downloads last month: -