Add ACIDS public catalog (10 models, ~1.6 GB)
Browse filesMirrors the canonical anonymous-download set from
https://play.forum.ircam.fr/rave-vst-api/get_available_models:
VCTK, darbouka_onnx, nasa, percussion, vintage, isis, musicnet,
sol_ordinario, sol_full, sol_ordinario_fast.
Each .ts ships a matching .json sidecar (same schema as the IIL set).
README rewritten to cover both the IIL-curated mirror and this new set.
All CC-BY-NC-4.0 inherited from upstream ACIDS.
- .gitattributes +10 -0
- README.md +112 -9
- VCTK.json +12 -0
- VCTK.ts +3 -0
- darbouka_onnx.json +12 -0
- darbouka_onnx.ts +3 -0
- isis.json +12 -0
- isis.ts +3 -0
- musicnet.json +12 -0
- musicnet.ts +3 -0
- nasa.json +12 -0
- nasa.ts +3 -0
- percussion.json +12 -0
- percussion.ts +3 -0
- sol_full.json +12 -0
- sol_full.ts +3 -0
- sol_ordinario.json +12 -0
- sol_ordinario.ts +3 -0
- sol_ordinario_fast.json +12 -0
- sol_ordinario_fast.ts +3 -0
- vintage.json +12 -0
- vintage.ts +3 -0
.gitattributes
CHANGED
|
@@ -51,3 +51,13 @@ voice-multi-b2048-r48000-z11.ts filter=lfs diff=lfs merge=lfs -text
|
|
| 51 |
voice_hifitts_b2048_r48000_z16.ts filter=lfs diff=lfs merge=lfs -text
|
| 52 |
voice_jvs_b2048_r44100_z16.ts filter=lfs diff=lfs merge=lfs -text
|
| 53 |
voice_vctk_b2048_r44100_z22.ts filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 51 |
voice_hifitts_b2048_r48000_z16.ts filter=lfs diff=lfs merge=lfs -text
|
| 52 |
voice_jvs_b2048_r44100_z16.ts filter=lfs diff=lfs merge=lfs -text
|
| 53 |
voice_vctk_b2048_r44100_z22.ts filter=lfs diff=lfs merge=lfs -text
|
| 54 |
+
VCTK.ts filter=lfs diff=lfs merge=lfs -text
|
| 55 |
+
darbouka_onnx.ts filter=lfs diff=lfs merge=lfs -text
|
| 56 |
+
isis.ts filter=lfs diff=lfs merge=lfs -text
|
| 57 |
+
musicnet.ts filter=lfs diff=lfs merge=lfs -text
|
| 58 |
+
nasa.ts filter=lfs diff=lfs merge=lfs -text
|
| 59 |
+
percussion.ts filter=lfs diff=lfs merge=lfs -text
|
| 60 |
+
sol_full.ts filter=lfs diff=lfs merge=lfs -text
|
| 61 |
+
sol_ordinario.ts filter=lfs diff=lfs merge=lfs -text
|
| 62 |
+
sol_ordinario_fast.ts filter=lfs diff=lfs merge=lfs -text
|
| 63 |
+
vintage.ts filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -1,28 +1,131 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
# RAVE — AEmotionStudio mirror
|
| 2 |
|
| 3 |
-
Curated mirror of
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
|
| 5 |
-
|
| 6 |
|
| 7 |
-
##
|
| 8 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
- `voice_vocalset_b2048_r48000_z16.ts` — **Voice (VocalSet)**. Voice timbre trained on the VocalSet corpus — covers vocal techniques across multiple singers. Use for the canonical 'make this sound like a voice' transfer.
|
| 10 |
- `voice-multi-b2048-r48000-z11.ts` — **Voice (Multi-speaker)**. Aggregated multi-speaker voice corpus. Wider speaker diversity than VocalSet — produces more 'average human' renders.
|
| 11 |
-
- `voice_hifitts_b2048_r48000_z16.ts` — **Voice (HiFi-TTS)**.
|
| 12 |
-
- `voice_jvs_b2048_r44100_z16.ts` — **Voice (JVS, Japanese)**. JVS
|
| 13 |
-
- `voice_vctk_b2048_r44100_z22.ts` — **Voice (VCTK, English)**. VCTK
|
|
|
|
|
|
|
| 14 |
- `birds_motherbird_b2048_r48000_z16.ts` — **Birds (Motherbird)**. Bird-vocalization corpus — chirps + textural transients. The canonical 'weird' pick: produces wildly warped output for any arbitrary input.
|
| 15 |
- `birds_dawnchorus_b2048_r48000_z8.ts` — **Birds (Dawn Chorus)**. Dense overlapping bird vocalizations recorded at dawn. Smaller 8-dim latent — outputs lean ensemble-textural over individual calls.
|
| 16 |
- `birds_pluma_b2048_r48000_z12.ts` — **Birds (Pluma)**. Lighter, individual bird-call timbres. Mid-size 12-dim latent balances character + clarity.
|
| 17 |
- `humpbacks_pondbrain_b2048_r48000_z20.ts` — **Humpback Whales**. Humpback-whale song. Long, slow, hauntingly-deep vocal contours — pairs well with sustained input.
|
| 18 |
- `marinemammals_pondbrain_b2048_r48000_z20.ts` — **Marine Mammals**. Mixed marine-mammal vocalizations — dolphins, orcas, sea-life clicks and cries.
|
|
|
|
|
|
|
| 19 |
- `guitar_iil_b2048_r48000_z16.ts` — **Guitar (IIL)**. Acoustic / electric guitar timbre. Good demo for transferring voice or synth input into a plucked-string voice.
|
| 20 |
- `organ_bach_b2048_r48000_z16.ts` — **Organ (Bach)**. Pipe-organ timbre trained on Bach repertoire. Sustained harmonic textures — pairs well with melodic input.
|
| 21 |
- `organ_archive_b2048_r48000_z16.ts` — **Organ (Archive)**. Historical pipe-organ recordings — broader, dustier textures than the Bach model. Good for film-score atmospheres.
|
| 22 |
- `sax_soprano_franziskaschroeder_b2048_r48000_z20.ts` — **Soprano Sax (Schroeder)**. Soprano-saxophone extended techniques by Franziska Schroeder. Multiphonics, growls, key clicks. 20-dim latent — captures fine-grained articulation.
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
- `water_pondbrain_b2048_r48000_z16.ts` — **Water (PondBrain)**. Water / aquatic textures. Treats any input as if it were running through liquid — bubbles, ripples, splashes.
|
| 24 |
- `magnets_b2048_r48000_z8.ts` — **Magnets**. Ferromagnetic / electromagnetic resonance textures — metallic hums, distant industrial buzz, magnetized-string ringing.
|
| 25 |
-
- `mrp_strengjavera_b2048_r44100_z16.ts` — **Magnetic Resonator Piano (Strengjavera)**. Magnetic Resonator Piano. Sustained metallic-string overtones produced by electromagnetically driving piano strings — 44.1 kHz.
|
| 26 |
-
- `crozzoli_bigensemblesmusic_18d.ts` — **Big Ensemble Music (Crozzoli)**. Big-ensemble orchestral music (M. Crozzoli). Broad 18-dim latent for hugely-textured renders. Sample rate not embedded in filename — defaults to 48000; override via panel if needed.
|
| 27 |
|
| 28 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: cc-by-nc-4.0
|
| 3 |
+
tags:
|
| 4 |
+
- audio
|
| 5 |
+
- rave
|
| 6 |
+
- timbre-transfer
|
| 7 |
+
- neural-synthesis
|
| 8 |
+
- ircam
|
| 9 |
+
- maestro
|
| 10 |
+
language:
|
| 11 |
+
- en
|
| 12 |
+
pipeline_tag: audio-to-audio
|
| 13 |
+
---
|
| 14 |
+
|
| 15 |
# RAVE — AEmotionStudio mirror
|
| 16 |
|
| 17 |
+
Curated mirror of public **RAVE** (Realtime Audio Variational autoEncoder) checkpoints, used
|
| 18 |
+
by MAESTRO's RAVE Timbre Transfer panel (opt-in starter pack). Sources:
|
| 19 |
+
|
| 20 |
+
- The [Intelligent-Instruments-Lab/rave-models](https://huggingface.co/Intelligent-Instruments-Lab/rave-models) curated set (birds, voices, organs, water, etc.).
|
| 21 |
+
- The [official ACIDS-IRCAM public catalog](https://acids-ircam.github.io/rave_models_download.html), pulled from the canonical anonymous API at `https://play.forum.ircam.fr/rave-vst-api/get_available_models`.
|
| 22 |
+
|
| 23 |
+
RAVE was developed by [Antoine Caillon](https://caillonantoine.github.io/) and the
|
| 24 |
+
[ACIDS team at IRCAM](https://www.ircam.fr/). Paper: [arXiv:2111.05011](https://arxiv.org/abs/2111.05011).
|
| 25 |
+
Upstream code: [acids-ircam/RAVE](https://github.com/acids-ircam/RAVE).
|
| 26 |
+
|
| 27 |
+
## License
|
| 28 |
+
|
| 29 |
+
**CC-BY-NC-4.0** — non-commercial use only, inherited from the upstream distributions.
|
| 30 |
+
Generated audio is fine for non-commercial use. Commercial use of the *models themselves*
|
| 31 |
+
(e.g. shipping them inside a paid product) requires permission from the original authors / IRCAM.
|
| 32 |
+
|
| 33 |
+
Per MAESTRO's stance (see `LICENSE_AUDIT.md` and the `feedback_download_on_demand_licensing`
|
| 34 |
+
memory), these weights are fetched *on demand* by the end user — the user (not MAESTRO the
|
| 35 |
+
binary) is the licensee.
|
| 36 |
|
| 37 |
+
---
|
| 38 |
|
| 39 |
+
## Models — IIL-curated set (b2048 streaming exports, 18 models)
|
| 40 |
|
| 41 |
+
Each `.ts` checkpoint has a `<stem>.json` sidecar with name, license, sample-rate, latent-dim,
|
| 42 |
+
source URL, and a one-line description.
|
| 43 |
+
|
| 44 |
+
### Voice / speech
|
| 45 |
- `voice_vocalset_b2048_r48000_z16.ts` — **Voice (VocalSet)**. Voice timbre trained on the VocalSet corpus — covers vocal techniques across multiple singers. Use for the canonical 'make this sound like a voice' transfer.
|
| 46 |
- `voice-multi-b2048-r48000-z11.ts` — **Voice (Multi-speaker)**. Aggregated multi-speaker voice corpus. Wider speaker diversity than VocalSet — produces more 'average human' renders.
|
| 47 |
+
- `voice_hifitts_b2048_r48000_z16.ts` — **Voice (HiFi-TTS)**. High-fidelity expressive English speech corpus. Cleaner, more articulate than the multi-speaker model.
|
| 48 |
+
- `voice_jvs_b2048_r44100_z16.ts` — **Voice (JVS, Japanese)**. JVS Japanese multi-speaker corpus at 44.1 kHz. Use for Japanese-language sources or non-Latin phoneme structure.
|
| 49 |
+
- `voice_vctk_b2048_r44100_z22.ts` — **Voice (VCTK, English)**. VCTK English multi-speaker corpus from CSTR Edinburgh, 44.1 kHz. High 22-dim latent — captures more speaker idiosyncrasies.
|
| 50 |
+
|
| 51 |
+
### Bird / wildlife
|
| 52 |
- `birds_motherbird_b2048_r48000_z16.ts` — **Birds (Motherbird)**. Bird-vocalization corpus — chirps + textural transients. The canonical 'weird' pick: produces wildly warped output for any arbitrary input.
|
| 53 |
- `birds_dawnchorus_b2048_r48000_z8.ts` — **Birds (Dawn Chorus)**. Dense overlapping bird vocalizations recorded at dawn. Smaller 8-dim latent — outputs lean ensemble-textural over individual calls.
|
| 54 |
- `birds_pluma_b2048_r48000_z12.ts` — **Birds (Pluma)**. Lighter, individual bird-call timbres. Mid-size 12-dim latent balances character + clarity.
|
| 55 |
- `humpbacks_pondbrain_b2048_r48000_z20.ts` — **Humpback Whales**. Humpback-whale song. Long, slow, hauntingly-deep vocal contours — pairs well with sustained input.
|
| 56 |
- `marinemammals_pondbrain_b2048_r48000_z20.ts` — **Marine Mammals**. Mixed marine-mammal vocalizations — dolphins, orcas, sea-life clicks and cries.
|
| 57 |
+
|
| 58 |
+
### Instruments
|
| 59 |
- `guitar_iil_b2048_r48000_z16.ts` — **Guitar (IIL)**. Acoustic / electric guitar timbre. Good demo for transferring voice or synth input into a plucked-string voice.
|
| 60 |
- `organ_bach_b2048_r48000_z16.ts` — **Organ (Bach)**. Pipe-organ timbre trained on Bach repertoire. Sustained harmonic textures — pairs well with melodic input.
|
| 61 |
- `organ_archive_b2048_r48000_z16.ts` — **Organ (Archive)**. Historical pipe-organ recordings — broader, dustier textures than the Bach model. Good for film-score atmospheres.
|
| 62 |
- `sax_soprano_franziskaschroeder_b2048_r48000_z20.ts` — **Soprano Sax (Schroeder)**. Soprano-saxophone extended techniques by Franziska Schroeder. Multiphonics, growls, key clicks. 20-dim latent — captures fine-grained articulation.
|
| 63 |
+
- `mrp_strengjavera_b2048_r44100_z16.ts` — **Magnetic Resonator Piano (Strengjavera)**. Sustained metallic-string overtones produced by electromagnetically driving piano strings — 44.1 kHz.
|
| 64 |
+
- `crozzoli_bigensemblesmusic_18d.ts` — **Big Ensemble Music (Crozzoli)**. Big-ensemble orchestral music (M. Crozzoli). Broad 18-dim latent for hugely-textured renders. Sample rate not embedded in filename — defaults to 48 kHz.
|
| 65 |
+
|
| 66 |
+
### Textures / environment
|
| 67 |
- `water_pondbrain_b2048_r48000_z16.ts` — **Water (PondBrain)**. Water / aquatic textures. Treats any input as if it were running through liquid — bubbles, ripples, splashes.
|
| 68 |
- `magnets_b2048_r48000_z8.ts` — **Magnets**. Ferromagnetic / electromagnetic resonance textures — metallic hums, distant industrial buzz, magnetized-string ringing.
|
|
|
|
|
|
|
| 69 |
|
| 70 |
+
---
|
| 71 |
+
|
| 72 |
+
## Models — ACIDS public catalog (10 models, mirrored 2026-05-18)
|
| 73 |
+
|
| 74 |
+
Pulled from the canonical anonymous-download endpoint `https://play.forum.ircam.fr/rave-vst-api/get_model/<slug>`.
|
| 75 |
+
Each `.ts` has a matching `<slug>.json` sidecar in the same schema as the IIL set.
|
| 76 |
+
|
| 77 |
+
| Slug | Display name | Type | Author | Year | Size | Prior |
|
| 78 |
+
|---|---|---|---|---|---|---|
|
| 79 |
+
| `VCTK` | VCTK (English Speech) | RAVE v1 (default) | Jb Dupuy | 2022 | 177 MB | ✓ |
|
| 80 |
+
| `darbouka_onnx` | Darbouka (Percussion) | RAVE v2 (ONNX) | Antoine Caillon | 2022 | 26 MB | – |
|
| 81 |
+
| `nasa` | NASA Apollo 11 | RAVE v1 (default) | Antoine Caillon | 2022 | 159 MB | ✓ |
|
| 82 |
+
| `percussion` | Percussion (Mixed) | RAVE v1 (default) | Antoine Caillon | 2022 | 71 MB | ✓ |
|
| 83 |
+
| `vintage` | Vintage Music | RAVE v1 (large) | Antoine Caillon | 2022 | 482 MB | ✓ |
|
| 84 |
+
| `isis` | ISiS (IRCAM Vocal DB) | RAVE v2 | A. Chemla–Romeu-Santos | 2023 | 149 MB | – |
|
| 85 |
+
| `musicnet` | MusicNet (Classical) | RAVE v2 | A. Chemla–Romeu-Santos | 2023 | 237 MB | ✓ |
|
| 86 |
+
| `sol_ordinario` | Studio OnLine (Ordinario) | RAVE v2 | A. Chemla–Romeu-Santos | 2023 | 149 MB | – |
|
| 87 |
+
| `sol_full` | Studio OnLine (Full) | RAVE v2 | A. Chemla–Romeu-Santos | 2023 | 149 MB | – |
|
| 88 |
+
| `sol_ordinario_fast` | Studio OnLine (Ordinario, fast) | RAVE v2 (small) | A. Chemla–Romeu-Santos | 2023 | 43 MB | – |
|
| 89 |
+
|
| 90 |
+
**ACIDS set total: ~1.6 GB across 10 models.**
|
| 91 |
+
|
| 92 |
+
> Note: `VCTK.ts` (ACIDS v1, 48 kHz, original 2022 release) and `voice_vctk_b2048_r44100_z22.ts`
|
| 93 |
+
> (IIL v2 retrain, 44.1 kHz) are *different* models trained on the same source corpus —
|
| 94 |
+
> keep both for comparison.
|
| 95 |
+
|
| 96 |
+
---
|
| 97 |
+
|
| 98 |
+
## File format
|
| 99 |
+
|
| 100 |
+
Each `*.ts` is a [TorchScript](https://pytorch.org/docs/stable/jit.html) export of the RAVE model,
|
| 101 |
+
streaming-mode (causal convolutions, cached state) — ready for realtime or offline inference.
|
| 102 |
+
|
| 103 |
+
```python
|
| 104 |
+
import torch
|
| 105 |
+
model = torch.jit.load("vintage.ts")
|
| 106 |
+
# Encode (B, 1, T) → latents
|
| 107 |
+
z = model.encode(audio)
|
| 108 |
+
# Decode latents → audio
|
| 109 |
+
y = model.decode(z)
|
| 110 |
+
```
|
| 111 |
+
|
| 112 |
+
Models with "Prior available" additionally ship a learned prior that can generate latents
|
| 113 |
+
autoregressively (see the [RAVE repo](https://github.com/acids-ircam/RAVE) for usage).
|
| 114 |
+
|
| 115 |
+
## Where to find more RAVE models
|
| 116 |
+
|
| 117 |
+
- [Neutone FX models](https://neutone.ai/fx/models) — community + curated `.nm` files (the Neutone wrapper format).
|
| 118 |
+
- [IRCAM Forum projects](https://forum.ircam.fr/) — individual user-submitted models; many require Forum account.
|
| 119 |
+
- [acids-ircam GitHub releases](https://github.com/acids-ircam/RAVE/releases) — reference checkpoints from the maintainers.
|
| 120 |
+
- [IRCAM RAVE Model Challenge 2025](https://forum.ircam.fr/collections/detail/rave-model-challenge-models/) — 11 prize-winner / submission models gated behind a Forum account.
|
| 121 |
+
|
| 122 |
+
## Citation
|
| 123 |
+
|
| 124 |
+
```bibtex
|
| 125 |
+
@inproceedings{caillon2021rave,
|
| 126 |
+
title={RAVE: A variational autoencoder for fast and high-quality neural audio synthesis},
|
| 127 |
+
author={Caillon, Antoine and Esling, Philippe},
|
| 128 |
+
booktitle={arXiv preprint arXiv:2111.05011},
|
| 129 |
+
year={2021}
|
| 130 |
+
}
|
| 131 |
+
```
|
VCTK.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"name": "VCTK (English Speech)",
|
| 3 |
+
"license": "CC-BY-NC-4.0",
|
| 4 |
+
"sample_rate": 48000,
|
| 5 |
+
"latent_dim": null,
|
| 6 |
+
"source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/VCTK",
|
| 7 |
+
"description": "Trained on the VCTK speech corpus (CSTR Edinburgh) \u2014 multi-speaker English. Different from voice_vctk_b2048_r44100_z22.ts in this same repo (which is the IIL-curated v2 retrain); this is the original ACIDS v1 release.",
|
| 8 |
+
"author": "Jb Dupuy",
|
| 9 |
+
"release_date": "2022-05-11",
|
| 10 |
+
"model_type": "RAVE v1 (default)",
|
| 11 |
+
"prior_available": true
|
| 12 |
+
}
|
VCTK.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6d8c8cb4726a2660698de7b6311131975f9f2ddd5915113590b2ae2d9f9c4a38
|
| 3 |
+
size 177167756
|
darbouka_onnx.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"name": "Darbouka (Percussion)",
|
| 3 |
+
"license": "CC-BY-NC-4.0",
|
| 4 |
+
"sample_rate": 48000,
|
| 5 |
+
"latent_dim": 4,
|
| 6 |
+
"source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/darbouka_onnx",
|
| 7 |
+
"description": "8 hours of darbouka (goblet drum) recordings \u2014 Middle-Eastern hand drum. ONNX-exported v2. Sharp transients + breathy decays. Good for re-skinning any percussive input into Mediterranean / North-African textures.",
|
| 8 |
+
"author": "Antoine Caillon",
|
| 9 |
+
"release_date": "2022-09-21",
|
| 10 |
+
"model_type": "RAVE v2 (ONNX)",
|
| 11 |
+
"prior_available": false
|
| 12 |
+
}
|
darbouka_onnx.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2631209242a90aaeca4c1c2985b1b104595c60713b44fef0a1c1e49b8e995f98
|
| 3 |
+
size 26350036
|
isis.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"name": "ISiS (IRCAM Vocal Database)",
|
| 3 |
+
"license": "CC-BY-NC-4.0",
|
| 4 |
+
"sample_rate": 44100,
|
| 5 |
+
"latent_dim": 8,
|
| 6 |
+
"source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/isis",
|
| 7 |
+
"description": "Trained on the IRCAM ISiS vocal analysis-synthesis database. Operatic + extended-technique vocal timbres with rich formant structure. See forum.ircam.fr/projects/detail/isis/ for the source corpus.",
|
| 8 |
+
"author": "Axel Chemla\u2013Romeu-Santos",
|
| 9 |
+
"release_date": "2023-12-10",
|
| 10 |
+
"model_type": "RAVE v2",
|
| 11 |
+
"prior_available": false
|
| 12 |
+
}
|
isis.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:16c5502112eb27cd9348690a776e1d60ee4f9002e2855d39354d75401b8c408b
|
| 3 |
+
size 148725070
|
musicnet.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"name": "MusicNet (Classical Ensemble)",
|
| 3 |
+
"license": "CC-BY-NC-4.0",
|
| 4 |
+
"sample_rate": 44100,
|
| 5 |
+
"latent_dim": 16,
|
| 6 |
+
"source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/musicnet",
|
| 7 |
+
"description": "Trained on the MusicNet database (annotated classical chamber + orchestral recordings). Broad 16-dim latent captures multi-instrument timbral blends; good 'classical-ensemble' transfer target.",
|
| 8 |
+
"author": "Axel Chemla\u2013Romeu-Santos",
|
| 9 |
+
"release_date": "2023-12-10",
|
| 10 |
+
"model_type": "RAVE v2",
|
| 11 |
+
"prior_available": true
|
| 12 |
+
}
|
musicnet.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5228bc514c0a38280b774fd5a75ba5883e40013419a5acb0840ebd56d36e855b
|
| 3 |
+
size 236989418
|
nasa.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"name": "NASA Apollo 11",
|
| 3 |
+
"license": "CC-BY-NC-4.0",
|
| 4 |
+
"sample_rate": 48000,
|
| 5 |
+
"latent_dim": 8,
|
| 6 |
+
"source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/nasa",
|
| 7 |
+
"description": "Trained on radio communications from the Apollo 11 mission. Distinctive comms-filtered voice + cosmic interference textures. Use any input \u2192 eerie spacefaring transmissions. Source: youtube.com/watch?v=DejhGSEu8wk",
|
| 8 |
+
"author": "Antoine Caillon",
|
| 9 |
+
"release_date": "2022-09-21",
|
| 10 |
+
"model_type": "RAVE v1 (default)",
|
| 11 |
+
"prior_available": true
|
| 12 |
+
}
|
nasa.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:60b198ad1ae9450f16c00c6a27ca9332924eb15e57c1a1efb90909038dcde458
|
| 3 |
+
size 159284126
|
percussion.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"name": "Percussion (Mixed)",
|
| 3 |
+
"license": "CC-BY-NC-4.0",
|
| 4 |
+
"sample_rate": 44100,
|
| 5 |
+
"latent_dim": null,
|
| 6 |
+
"source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/percussion",
|
| 7 |
+
"description": "8 hours of mixed percussion recordings \u2014 wide range of struck / hit / shaken sources. Good general-purpose 'turn this into rhythm' transfer; complements darbouka with broader timbral coverage.",
|
| 8 |
+
"author": "Antoine Caillon",
|
| 9 |
+
"release_date": "2022-09-21",
|
| 10 |
+
"model_type": "RAVE v1 (default)",
|
| 11 |
+
"prior_available": true
|
| 12 |
+
}
|
percussion.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:12e044fa4cf0f461fa78dc5e108253d5ffaa395604a66aa43e3a9c3967dee0f9
|
| 3 |
+
size 71097032
|
sol_full.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"name": "Studio OnLine (Full)",
|
| 3 |
+
"license": "CC-BY-NC-4.0",
|
| 4 |
+
"sample_rate": 44100,
|
| 5 |
+
"latent_dim": 8,
|
| 6 |
+
"source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/sol_full",
|
| 7 |
+
"description": "Full Studio OnLine database \u2014 ordinario PLUS extended techniques across every catalogued instrument. Bigger sonic palette than sol_ordinario at the cost of less consistency.",
|
| 8 |
+
"author": "Axel Chemla\u2013Romeu-Santos",
|
| 9 |
+
"release_date": "2023-12-10",
|
| 10 |
+
"model_type": "RAVE v2",
|
| 11 |
+
"prior_available": false
|
| 12 |
+
}
|
sol_full.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dba344fbd634937ed72779264d6d30fb85b9e125e7e1ab687c3c8145902d4a94
|
| 3 |
+
size 148724592
|
sol_ordinario.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"name": "Studio OnLine (Ordinario)",
|
| 3 |
+
"license": "CC-BY-NC-4.0",
|
| 4 |
+
"sample_rate": 44100,
|
| 5 |
+
"latent_dim": 4,
|
| 6 |
+
"source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/sol_ordinario",
|
| 7 |
+
"description": "Ordinario (standard playing-technique) recordings from IRCAM's Studio OnLine database \u2014 strings, winds, brass, percussion in conventional articulations. Compact 4-dim latent; very clean re-voicing.",
|
| 8 |
+
"author": "Axel Chemla\u2013Romeu-Santos",
|
| 9 |
+
"release_date": "2023-12-10",
|
| 10 |
+
"model_type": "RAVE v2",
|
| 11 |
+
"prior_available": false
|
| 12 |
+
}
|
sol_ordinario.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:84a620c61b8d3a28ee35c4abdc796a53bdd17c97adf81e3e4c7243cf8894a3ac
|
| 3 |
+
size 148733766
|
sol_ordinario_fast.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"name": "Studio OnLine (Ordinario, fast)",
|
| 3 |
+
"license": "CC-BY-NC-4.0",
|
| 4 |
+
"sample_rate": 44100,
|
| 5 |
+
"latent_dim": 8,
|
| 6 |
+
"source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/sol_ordinario_fast",
|
| 7 |
+
"description": "Smaller / faster variant of sol_ordinario (43 MB vs 149 MB). Lower-fidelity rendering but much cheaper on CPU. Good default starter pick for live exploration before committing to the larger ordinario model.",
|
| 8 |
+
"author": "Axel Chemla\u2013Romeu-Santos",
|
| 9 |
+
"release_date": "2023-10-10",
|
| 10 |
+
"model_type": "RAVE v2 (small)",
|
| 11 |
+
"prior_available": false
|
| 12 |
+
}
|
sol_ordinario_fast.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9a8d32002efd6f385b521d64691e0d2d90d691ca4c50e85149ea88c6be399793
|
| 3 |
+
size 43058211
|
vintage.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"name": "Vintage Music",
|
| 3 |
+
"license": "CC-BY-NC-4.0",
|
| 4 |
+
"sample_rate": 48000,
|
| 5 |
+
"latent_dim": null,
|
| 6 |
+
"source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/vintage",
|
| 7 |
+
"description": "80 hours of vintage-music recordings \u2014 the canonical 'large' ACIDS demo checkpoint (482 MB). Lush, lo-fi, archival-feeling outputs. Slowest of the set on CPU but the most musically-coherent timbre transfers.",
|
| 8 |
+
"author": "Antoine Caillon",
|
| 9 |
+
"release_date": "2022-09-21",
|
| 10 |
+
"model_type": "RAVE v1 (large)",
|
| 11 |
+
"prior_available": true
|
| 12 |
+
}
|
vintage.ts
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ba4bb7e281658f2de0e97d47ecbdb7793169804bd1833859619f528b128ebf15
|
| 3 |
+
size 481781124
|