File size: 2,666 Bytes
07ebf7a
 
 
75abbd1
 
 
 
 
07ebf7a
75abbd1
 
 
 
 
07ebf7a
75abbd1
fdb0cb1
75abbd1
 
 
07ebf7a
75abbd1
 
 
 
 
 
 
 
3bf275f
75abbd1
 
 
 
 
3bf275f
75abbd1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
fdb0cb1
75abbd1
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
---
license: mit
---
# RAVE Models by Tangible Music Lab

This is a collection of RAVE (Realtime Audio Variational autoEncoder) models trained by the Tangible Music Lab for audio generation and transformation. The aim of this repository is to provide musicians with pre-trained models for building embedded RAVE models on the Raspberry Pi platform or similar, for physical hardware and tangible interface development for sound and music experimentation. These models enable real-time audio manipulation and generation while being optimized for resource-constrained environments, making them ideal for interactive musical instruments and sound installations.


## Model Description

- **Developed by:** Tangible Music Lab
- **Model type:** RAVE (Realtime Audio Variational autoEncoder) 
- **License:** MIT

## Model Sources

- **Repository:** https://huggingface.co/Tangible-Music-Lab/rave_models
- **Training Code:** https://github.com/victor-shepardson/RAVE


## Direct Use

These models are designed for real-time audio generation and transformation. They can be used with:
- nn~
- NN.ar
- rave-supercollider

### Models

#### tam_freesoundloop10k_default_b2048_r48000_z16.ts
- Dataset: Freesound Loop Dataset
- Model: RAVE v3 with default configuration 
- Latent dimensions: 16
- Sample rate: 48kHz

#### tam_freesoundloop10k_raspi_b2048_r44100_z16.ts
- Dataset: Freesound Loop Dataset
- Model: Modified RAVE v3, optimized for Raspberry Pi 5
- Latent dimensions: 16
- Sample rate: 44.1kHz
- Special features: Scaled down for real-time performance on RPi 5

## Features
- All models are exported for streaming inference
- Compatible with nn~, NN.ar, and rave-supercollider
- Models focus on encoder-decoder architecture without prior networks
- Training checkpoints provided for transfer learning 
- For training, use the Intelligent Instruments Lab RAVE fork: https://github.com/victor-shepardson/RAVE

## Training Details

### Training Data

The models were trained on the [Freesound Loop Dataset (FSL10K)](https://zenodo.org/records/3967852), a comprehensive collection of musical loops curated for machine learning applications. The dataset consists of 9,455 loops from Freesound.org.
All sounds in the dataset are licensed under various Creative Commons licenses.

### Training Procedure

Training checkpoints are provided for both models to enable transfer learning on custom datasets.

## Citation

```bibtex
@misc {tangible_music_lab_2025,
   author       = { {Tangible Music Lab} },
   title        = { RAVE Models },
   year         = 2025,
   url          = { https://huggingface.co/Tangible-Music-Lab/rave_models },
   publisher    = { Hugging Face }
}