RAVE_models / README.md
Tangible-Music-Lab's picture
Update README.md
3bf275f verified
---
license: mit
---
# RAVE Models by Tangible Music Lab
This is a collection of RAVE (Realtime Audio Variational autoEncoder) models trained by the Tangible Music Lab for audio generation and transformation. The aim of this repository is to provide musicians with pre-trained models for building embedded RAVE models on the Raspberry Pi platform or similar, for physical hardware and tangible interface development for sound and music experimentation. These models enable real-time audio manipulation and generation while being optimized for resource-constrained environments, making them ideal for interactive musical instruments and sound installations.
## Model Description
- **Developed by:** Tangible Music Lab
- **Model type:** RAVE (Realtime Audio Variational autoEncoder)
- **License:** MIT
## Model Sources
- **Repository:** https://huggingface.co/Tangible-Music-Lab/rave_models
- **Training Code:** https://github.com/victor-shepardson/RAVE
## Direct Use
These models are designed for real-time audio generation and transformation. They can be used with:
- nn~
- NN.ar
- rave-supercollider
### Models
#### tam_freesoundloop10k_default_b2048_r48000_z16.ts
- Dataset: Freesound Loop Dataset
- Model: RAVE v3 with default configuration
- Latent dimensions: 16
- Sample rate: 48kHz
#### tam_freesoundloop10k_raspi_b2048_r44100_z16.ts
- Dataset: Freesound Loop Dataset
- Model: Modified RAVE v3, optimized for Raspberry Pi 5
- Latent dimensions: 16
- Sample rate: 44.1kHz
- Special features: Scaled down for real-time performance on RPi 5
## Features
- All models are exported for streaming inference
- Compatible with nn~, NN.ar, and rave-supercollider
- Models focus on encoder-decoder architecture without prior networks
- Training checkpoints provided for transfer learning
- For training, use the Intelligent Instruments Lab RAVE fork: https://github.com/victor-shepardson/RAVE
## Training Details
### Training Data
The models were trained on the [Freesound Loop Dataset (FSL10K)](https://zenodo.org/records/3967852), a comprehensive collection of musical loops curated for machine learning applications. The dataset consists of 9,455 loops from Freesound.org.
All sounds in the dataset are licensed under various Creative Commons licenses.
### Training Procedure
Training checkpoints are provided for both models to enable transfer learning on custom datasets.
## Citation
```bibtex
@misc {tangible_music_lab_2025,
author = { {Tangible Music Lab} },
title = { RAVE Models },
year = 2025,
url = { https://huggingface.co/Tangible-Music-Lab/rave_models },
publisher = { Hugging Face }
}