AEmotionStudio commited on
Commit
327fd8c
·
verified ·
1 Parent(s): 0e0e6e6

Vendor Spotify Basic Pitch ICASSP 2022 ONNX for MAESTRO

Browse files
Files changed (2) hide show
  1. README.md +57 -0
  2. nmp.onnx +3 -0
README.md ADDED
@@ -0,0 +1,57 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - music
7
+ - midi
8
+ - audio-to-midi
9
+ - polyphonic-transcription
10
+ - basic-pitch
11
+ - onnx
12
+ ---
13
+
14
+ # Basic Pitch (ONNX) Mirror
15
+
16
+ Vendored copy of Spotify's [Basic Pitch](https://github.com/spotify/basic-pitch)
17
+ ICASSP 2022 polyphonic transcription model in ONNX format, re-hosted for use
18
+ in the [MAESTRO AI Workstation](https://github.com/AEmotionStudio).
19
+
20
+ ## What this model does
21
+
22
+ **Audio → MIDI polyphonic transcription** for any pitched instrument:
23
+ guitar, bass, vocals, synth, piano. Lightweight (~230 KB) and fast.
24
+
25
+ ## Why ONNX (not the pip package)?
26
+
27
+ The official `basic-pitch` PyPI package depends on `tensorflow<2.15.1`,
28
+ which has no Python 3.14 wheels and would conflict with the MAESTRO
29
+ backend's torch installation. Spotify ships the same model as a small
30
+ ONNX export, which we serve here and run via `onnxruntime` — same model,
31
+ no TensorFlow dependency chain.
32
+
33
+ ## Architecture
34
+
35
+ CNN spectrogram encoder + multi-head pitch/onset/note prediction. See the
36
+ [ICASSP 2022 paper](https://arxiv.org/abs/2203.09893) for details.
37
+
38
+ ## License
39
+
40
+ **Apache-2.0** — commercial-use OK.
41
+
42
+ ## Usage in MAESTRO
43
+
44
+ Loaded by `backend/ai/models/basic_pitch.py` via `onnxruntime`; surfaced
45
+ in the AI Workstation's `TranscribePanel` under General / Drums / Vocals
46
+ mode tabs.
47
+
48
+ ## Citation
49
+
50
+ ```
51
+ @inproceedings{2022_BittnerBRME_LightweightNoteTranscription_ICASSP,
52
+ title={A lightweight instrument-agnostic model for polyphonic note transcription and multipitch estimation},
53
+ author={Bittner, Rachel M. and Bosch, Juan Jos{\'e} and Rubinstein, David and Meseguer-Brocal, Gabriel and Ewert, Sebastian},
54
+ booktitle={ICASSP 2022},
55
+ year={2022}
56
+ }
57
+ ```
nmp.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2c3c1d144bfa61ad236e92e169c13535c880469a12a047d4e73451f2c059a0ec
3
+ size 230444