Spaces:
Sleeping
Sleeping
File size: 2,823 Bytes
e162e78 1d4f6ee | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 | ---
title: Linearhead Leaderboard
emoji: π
colorFrom: gray
colorTo: yellow
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: false
license: apache-2.0
short_description: Comprehensive comparison of Linear-Head classifiers
---
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
# ποΈ Linear-Head Model Leaderboard
This leaderboard presents a comprehensive comparison of **Linear-Head classifiers** trained on a variety of **Self-Supervised Learning (SSL)** speech representations from the **S3PRL** library. It highlights model performance across multiple spoofing datasets, codecs, and TTS attacks in the context of **audio deepfake detection**.
---
## Frontend β SSL Feature Extractors
The **frontend** of each model is a frozen SSL feature extractor from **S3PRL**, capable of generating rich speech embeddings.
These extractors are pre-trained on large-scale audio corpora and capture different aspects of speech acoustics and phonetic content.
The leaderboard includes models built with several SSL backbones such as:
* **WavLM-Large**
* **Wav2Vec 2.0 XLSR (xls_r_300m)**
* **NPC 960 hr**
* **HuBERT**, **APC**, and others
Each extractor converts input waveforms into frame-level representations, serving as the foundation for downstream spoof detection.
---
## Backend β Classifier Models
On top of these SSL embeddings, four **downstream classifier architectures** are implemented.
Among them, the **Linear-Head model** serves as a lightweight yet highly effective backend.
It projects the SSL features into spoof/bonafide decision scores using a single fully connected layer trained with binary classification loss.
The simplicity of this approach allows fast adaptation and fair benchmarking across different SSL frontends.
---
## What the Leaderboard Shows
The leaderboard summarizes key results from extensive evaluations.
It includes separate sections for:
* **Main Leader Board** β Overall ranking based on average EER or TNR.
* **Models Performance on Each Data** β Per-dataset or per-attack breakdowns.
* **TTS Difficulty Level Per Model** β Shows which TTS generators most effectively fool the models.
* **Performance on Codecs** β Evaluates robustness under various compression schemes.
* **Best Model per Attack** β Highlights the top-performing model for each individual attack type.
---
## Purpose
The goal of this leaderboard is to provide a transparent, unified view of **how SSL-based frontends and lightweight classifier backends perform in deepfake speech detection tasks**.
It enables researchers and engineers to identify the most robust combinations of feature extractors and classifier heads, supporting future improvements in generalization, efficiency, and security of speech authentication systems.
|