JYP2024
/

Wedefense_ASV2025_WavLM_Base_Pruning

Audio Classification

Model card Files Files and versions

Wedefense_ASV2025_WavLM_Base_Pruning / README.md

JYP2024's picture

Update README.md

6993ddd verified 6 months ago

|

history blame contribute delete

1.26 kB

	---
	language:
	- en
	base_model:
	- microsoft/wavlm-base
	pipeline_tag: audio-classification
	datasets:
	- jungjee/asvspoof5
	tags:
	- anti-spoofing
	- asvspoof5
	---
	# 🔎 Hybrid Pruning for Anti-Spoofing Results

	- Input Feature: Raw waveform (via SSL model)
	- Frame Configuration: 150 frames per segment, 20 ms frame shift
	- Training Strategy: Jointly optimizing for task performance and model sparsity in a single stage. A warm-up schedule is used where the sparsity target linearly increases from 0 to the final value over the first 5 epochs.
	- Evaluation Metrics: minDCF, EER (%)
	- Evaluation Sets: Dev / Eval
	- Back-end: Multi-Head Factorized Attentive Pooling (MHFA)

	---

	# Results on ASVspoof 5

	The following table compares the performance of our proposed Hybrid Pruning (HP) single system against other top-performing systems from the official ASVspoof 5 Challenge leaderboard.

	\| System \| Dev minDCF \| Dev EER (%) \| Eval minDCF \| Eval EER (%) \|
	\| :--- \| :--- \| :--- \| :--- \| :--- \|
	\| Rank 3 (ID:T27, Fusion) \| - \| - \| 0.0937 \| 3.42 \|
	\| HP (ours, Single system) \| 0.0395 \| 1.547 \| 0.1028 \| 3.758 \|
	\| Rank 4 (ID:T23, Fusion) \| - \| - \| 0.1124 \| 4.16 \|
	\| Rank 9 (ID:T23, Best single system) \| - \| - \| 0.1499 \| 5.56 \|