Audio Classification
Russian
MusicDetection / README.md
NikiPshg's picture
Update README.md
93076b3 verified
metadata
license: cc-by-nc-4.0
language:
  - ru
base_model:
  - microsoft/wavlm-base-plus
pipeline_tag: audio-classification

Music Detection with WavLM

Detects if audio contains music.
EER: 2.5–3% | Based on microsoft/wavlm-base-plus the best threshold value 0.2442

Quick Start

git clone https://huggingface.co/MTUCI/MusicDetection
cd MusicDetection
pip install -r requirements.txt

Usage

from model import WavLMForMusicDetection
from safetensors import safe_open

model = WavLMForMusicDetection(batch_size=32, device='cuda')
with safe_open('music_detection.safetensors', framework="pt") as f:
    model.load_state_dict({k: f.get_tensor(k) for k in f.keys()})

probs = model.predict_proba(['audio1.mp3', 'audio2.wav'])  # → tensor([0.88, 0.11])