Spaces:
Running
Running
metadata
title: SOP Audio Analyzer
emoji: ποΈ
colorFrom: blue
colorTo: red
sdk: docker
pinned: false
SOP Audio Analyzer
Test Integrity Analysis - Voice fraud detection for take-at-home tests.
Features
- π€ Record or upload audio files
- π£οΈ Speaker diarization - detect multiple voices
- π― Voiceprint extraction - unique ID per speaker
- π Background analysis - detect whispers, distant voices
- π€ Synthetic detection - identify TTS/AI voices
- π’ Wake word detection - Alexa, Siri, Google
- ποΈ Cross-test tracking - find same voice across tests
Installation
# Create virtual environment
python -m venv venv
source venv/bin/activate # Linux/Mac
# or: venv\Scripts\activate # Windows
# Install dependencies
pip install -r requirements.txt
Run
streamlit run app.py
Project Structure
sop-audio-analyzer/
βββ app.py # Main Streamlit app
βββ requirements.txt
βββ src/
β βββ phase1_foundation/ # VAD, Diarization, Voiceprint
β βββ phase2_background/ # Background analysis
β βββ phase6_synthetic/ # Synthetic & wake word detection
β βββ database/ # SQLite models & queries
β βββ ui/ # UI components
βββ data/
β βββ db/ # SQLite database
β βββ clips/ # Extracted audio clips
βββ tests/
βββ audio/ # Test audio files
Usage
- Analyzer tab: Upload or record audio β Analyze β View results
- Database tab: Browse all voiceprints β Track across tests
Tech Stack
- SpeechBrain: VAD, diarization, speaker recognition
- Whisper: Transcription, wake word detection
- Streamlit: Web UI
- SQLite: Voiceprint database