drumsep / README.md
splitzo's picture
Upload README.md with huggingface_hub
f510fb1 verified
metadata
license: mit
tags:
  - audio
  - source-separation
  - drums
  - onnx
pipeline_tag: audio-to-audio

DrumSep ONNX

Drum element separation model converted to ONNX format for use with Splitzo.

Model Details

  • Original Author: inagoy/drumsep
  • License: MIT
  • Format: ONNX
  • Size: ~335 MB

Stems

Separates drum tracks into 4 elements:

  1. Kick (bombo)
  2. Snare (redoblante)
  3. Cymbals (platillos)
  4. Toms

Technical Specs

Parameter Value
Sample Rate 44100 Hz
Segment Length 40 seconds (1,764,000 samples)
STFT n_fft 4096
STFT hop_length 1024

Usage

This model is automatically downloaded by the Splitzo app on first use.

Input: Pre-separated drum stem (stereo, 44.1kHz) Output: 4 separated drum elements

Attribution

This ONNX conversion is based on the original DrumSep model by @inagoy.

MIT License

Copyright (c) 2023 inagoy

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software...

Conversion

Converted from PyTorch checkpoint using tools/convert_drumsep_onnx.py with STFT operations moved outside the model for ONNX compatibility.