| # Basic Pitch - Audio to MIDI |
|
|
| A Hugging Face Space for converting audio files to MIDI using Spotify's Basic Pitch model. |
|
|
| ## Features |
|
|
| - **Automatic Music Transcription (AMT):** Converts audio to MIDI notation |
| - **Instrument Agnostic:** Works with vocals, strings, brass, woodwinds, etc. |
| - **CPU Optimized:** Lightweight model designed for CPU inference |
| - **Polyphonic Detection:** Detects multiple simultaneous notes |
| - **Easy to Use:** Simple Gradio web interface |
|
|
| ## How to Use |
|
|
| 1. Upload an audio file (`.wav`, `.mp3`, `.ogg`, `.flac`, `.m4a`) |
| 2. Click "Transcribe to MIDI" |
| 3. Download the resulting MIDI file |
|
|
| ## Model Information |
|
|
| - **Model:** ICASSP 2022 (Spotify Basic Pitch) |
| - **Size:** ~20 MB |
| - **Inference Time:** ~1-2 seconds per minute of audio (CPU) |
| - **Hardware:** No GPU required |
|
|
| ## Best Practices |
|
|
| - Use mono audio for best results |
| - Avoid heavy background noise |
| - Works best with single instruments |
| - Clear, high-quality recordings produce better results |
|
|
| ## References |
|
|
| - [GitHub Repository](https://github.com/spotify/basic-pitch) |
| - [Paper](https://arxiv.org/abs/2209.00799) |
|
|