File size: 1,512 Bytes
f510fb1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
---
license: mit
tags:
  - audio
  - source-separation
  - drums
  - onnx
pipeline_tag: audio-to-audio
---

# DrumSep ONNX

Drum element separation model converted to ONNX format for use with Splitzo.

## Model Details

- **Original Author**: [inagoy/drumsep](https://github.com/inagoy/drumsep)
- **License**: MIT
- **Format**: ONNX
- **Size**: ~335 MB

## Stems

Separates drum tracks into 4 elements:
1. **Kick** (bombo)
2. **Snare** (redoblante)
3. **Cymbals** (platillos)
4. **Toms**

## Technical Specs

| Parameter | Value |
|-----------|-------|
| Sample Rate | 44100 Hz |
| Segment Length | 40 seconds (1,764,000 samples) |
| STFT n_fft | 4096 |
| STFT hop_length | 1024 |

## Usage

This model is automatically downloaded by the Splitzo app on first use.

**Input**: Pre-separated drum stem (stereo, 44.1kHz)
**Output**: 4 separated drum elements

## Attribution

This ONNX conversion is based on the original DrumSep model by [@inagoy](https://github.com/inagoy).

```
MIT License

Copyright (c) 2023 inagoy

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software...
```

## Conversion

Converted from PyTorch checkpoint using `tools/convert_drumsep_onnx.py` with STFT operations moved outside the model for ONNX compatibility.