--- license: mit tags: - audio - source-separation - drums - onnx pipeline_tag: audio-to-audio --- # DrumSep ONNX Drum element separation model converted to ONNX format for use with Splitzo. ## Model Details - **Original Author**: [inagoy/drumsep](https://github.com/inagoy/drumsep) - **License**: MIT - **Format**: ONNX - **Size**: ~335 MB ## Stems Separates drum tracks into 4 elements: 1. **Kick** (bombo) 2. **Snare** (redoblante) 3. **Cymbals** (platillos) 4. **Toms** ## Technical Specs | Parameter | Value | |-----------|-------| | Sample Rate | 44100 Hz | | Segment Length | 40 seconds (1,764,000 samples) | | STFT n_fft | 4096 | | STFT hop_length | 1024 | ## Usage This model is automatically downloaded by the Splitzo app on first use. **Input**: Pre-separated drum stem (stereo, 44.1kHz) **Output**: 4 separated drum elements ## Attribution This ONNX conversion is based on the original DrumSep model by [@inagoy](https://github.com/inagoy). ``` MIT License Copyright (c) 2023 inagoy Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software... ``` ## Conversion Converted from PyTorch checkpoint using `tools/convert_drumsep_onnx.py` with STFT operations moved outside the model for ONNX compatibility.