WEALY - wealy-sbert-lyc

WEALY model: wealy-sbert-lyc

This is a WEALY (WEakly-supervised Audio-LYrics) model for music version identification.

Usage

# Download and run inference
python scripts/inference.py \
    model_name=audio-based-lyrics-matching/wealy-sbert-lyc \
    hidden_states=/path/to/your/hidden-states \
    partition=test \
    use_overlapping_chunks=true \
    ngpus=1

Required Path Arguments

When running inference, you must provide:

hidden_states: Path to pre-extracted hidden states directory

Optional Arguments

partition: Dataset partition to evaluate (default: "test")
use_overlapping_chunks: Enable overlapping chunk evaluation (default: false)
chunk_size: Size of overlapping chunks (default: 1500)
overlap_percentage: Overlap between chunks (default: 0.9)
ngpus: Number of GPUs to use (default: 1)

Model Details

This model was trained for version identification using the WEALY architecture.

Training Configuration

Dataset: lyric-covers
Embedding type: sbert
Embedding dimension: 512

Citation

If you use this model, please cite:

@article{mancini2025wealy,
    title={Leveraging Whisper Embeddings for Audio-based Lyrics Matching},
    author={Mancini, Eleonora and Serrà, Joan and Torroni, Paolo and Mitsufuji, Yuki},
    journal={arXiv preprint arXiv:2510.08176},
    year={2025},
    url={https://github.com/helemanc/audio-based-lyrics-matching}
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for audio-based-lyrics-matching/wealy-sbert-lyc

Leveraging Whisper Embeddings for Audio-based Lyrics Matching

Paper • 2510.08176 • Published Oct 9, 2025