gunashree_hackathon / README.md
anish
Upgrade ML pipeline: 160+ signal features, new predict engine, feature-aware explanations
fc46ab2
metadata
title: AI Voice Detection API
emoji: 🎤
colorFrom: blue
colorTo: purple
sdk: docker
sdk_version: 4.38.0
app_port: 7860
pinned: false
license: mit

AI Voice Detection API

Detects whether a voice sample is AI-generated or human across multiple languages.

Supported Languages

  • Tamil
  • English
  • Hindi
  • Malayalam
  • Telugu

Features

  • FastAPI-based REST API
  • Wav2Vec2 embeddings + signal feature extraction
  • Pre-trained classifier for AI/Human voice detection
  • Base64 MP3 audio input
  • API key protected endpoints

API Endpoints

Health Check

GET /health

Voice Detection

POST /api/voice-detection
Headers:
  x-api-key: <your-api-key>

Body:
{
  "language": "English",
  "audioFormat": "mp3",
  "audioBase64": "<base64-encoded-audio>"
}

Response Format

{
  "status": "success",
  "language": "English",
  "classification": "HUMAN" | "AI_GENERATED",
  "confidenceScore": 0.95,
  "explanation": "Natural prosody, breathing patterns..."
}

Environment Variables

  • API_KEY: API key for authentication (default: "hackathon-secret")

Model Architecture

  • Uses Facebook's Wav2Vec2-base for audio embeddings
  • Extracts 160+ signal features (MFCCs, pitch, spectral, chroma, tonnetz, silence, onset, etc.)
  • Ensemble classifier (XGBoost + LightGBM) for final prediction
  • Feature-aware explanations using actual acoustic measurements

Team

  • ML: Gunashree
  • Backend: Tanu
  • DevOps/QA: Pavithra