sop-audio-analyzer / README.md
daasime's picture
Fix HF Spaces deployment: use Docker SDK, pin huggingface_hub, pre-download models
f3b6cec
metadata
title: SOP Audio Analyzer
emoji: πŸŽ™οΈ
colorFrom: blue
colorTo: red
sdk: docker
pinned: false

SOP Audio Analyzer

Test Integrity Analysis - Voice fraud detection for take-at-home tests.

Features

  • 🎀 Record or upload audio files
  • πŸ—£οΈ Speaker diarization - detect multiple voices
  • 🎯 Voiceprint extraction - unique ID per speaker
  • πŸ”ˆ Background analysis - detect whispers, distant voices
  • πŸ€– Synthetic detection - identify TTS/AI voices
  • πŸ“’ Wake word detection - Alexa, Siri, Google
  • πŸ—„οΈ Cross-test tracking - find same voice across tests

Installation

# Create virtual environment
python -m venv venv
source venv/bin/activate  # Linux/Mac
# or: venv\Scripts\activate  # Windows

# Install dependencies
pip install -r requirements.txt

Run

streamlit run app.py

Project Structure

sop-audio-analyzer/
β”œβ”€β”€ app.py                    # Main Streamlit app
β”œβ”€β”€ requirements.txt
β”œβ”€β”€ src/
β”‚   β”œβ”€β”€ phase1_foundation/    # VAD, Diarization, Voiceprint
β”‚   β”œβ”€β”€ phase2_background/    # Background analysis
β”‚   β”œβ”€β”€ phase6_synthetic/     # Synthetic & wake word detection
β”‚   β”œβ”€β”€ database/             # SQLite models & queries
β”‚   └── ui/                   # UI components
β”œβ”€β”€ data/
β”‚   β”œβ”€β”€ db/                   # SQLite database
β”‚   └── clips/                # Extracted audio clips
└── tests/
    └── audio/                # Test audio files

Usage

  1. Analyzer tab: Upload or record audio β†’ Analyze β†’ View results
  2. Database tab: Browse all voiceprints β†’ Track across tests

Tech Stack

  • SpeechBrain: VAD, diarization, speaker recognition
  • Whisper: Transcription, wake word detection
  • Streamlit: Web UI
  • SQLite: Voiceprint database