AEmotionStudio commited on
Commit
8f1acc9
·
verified ·
1 Parent(s): 4fe0f8f

Add ACIDS public catalog (10 models, ~1.6 GB)

Browse files

Mirrors the canonical anonymous-download set from
https://play.forum.ircam.fr/rave-vst-api/get_available_models:
VCTK, darbouka_onnx, nasa, percussion, vintage, isis, musicnet,
sol_ordinario, sol_full, sol_ordinario_fast.

Each .ts ships a matching .json sidecar (same schema as the IIL set).
README rewritten to cover both the IIL-curated mirror and this new set.
All CC-BY-NC-4.0 inherited from upstream ACIDS.

.gitattributes CHANGED
@@ -51,3 +51,13 @@ voice-multi-b2048-r48000-z11.ts filter=lfs diff=lfs merge=lfs -text
51
  voice_hifitts_b2048_r48000_z16.ts filter=lfs diff=lfs merge=lfs -text
52
  voice_jvs_b2048_r44100_z16.ts filter=lfs diff=lfs merge=lfs -text
53
  voice_vctk_b2048_r44100_z22.ts filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
51
  voice_hifitts_b2048_r48000_z16.ts filter=lfs diff=lfs merge=lfs -text
52
  voice_jvs_b2048_r44100_z16.ts filter=lfs diff=lfs merge=lfs -text
53
  voice_vctk_b2048_r44100_z22.ts filter=lfs diff=lfs merge=lfs -text
54
+ VCTK.ts filter=lfs diff=lfs merge=lfs -text
55
+ darbouka_onnx.ts filter=lfs diff=lfs merge=lfs -text
56
+ isis.ts filter=lfs diff=lfs merge=lfs -text
57
+ musicnet.ts filter=lfs diff=lfs merge=lfs -text
58
+ nasa.ts filter=lfs diff=lfs merge=lfs -text
59
+ percussion.ts filter=lfs diff=lfs merge=lfs -text
60
+ sol_full.ts filter=lfs diff=lfs merge=lfs -text
61
+ sol_ordinario.ts filter=lfs diff=lfs merge=lfs -text
62
+ sol_ordinario_fast.ts filter=lfs diff=lfs merge=lfs -text
63
+ vintage.ts filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,28 +1,131 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # RAVE — AEmotionStudio mirror
2
 
3
- Curated mirror of [Intelligent-Instruments-Lab/rave-models](https://huggingface.co/Intelligent-Instruments-Lab/rave-models) RAVE checkpoints, used by MAESTRO's RAVE Timbre Transfer panel (opt-in starter pack).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
 
5
- License: **CC-BY-NC-4.0** — non-commercial use only. See the upstream model card for full terms.
6
 
7
- ## Files
8
 
 
 
 
 
9
  - `voice_vocalset_b2048_r48000_z16.ts` — **Voice (VocalSet)**. Voice timbre trained on the VocalSet corpus — covers vocal techniques across multiple singers. Use for the canonical 'make this sound like a voice' transfer.
10
  - `voice-multi-b2048-r48000-z11.ts` — **Voice (Multi-speaker)**. Aggregated multi-speaker voice corpus. Wider speaker diversity than VocalSet — produces more 'average human' renders.
11
- - `voice_hifitts_b2048_r48000_z16.ts` — **Voice (HiFi-TTS)**. HiFi-TTS — high-fidelity expressive English speech corpus. Cleaner, more articulate than the multi-speaker model.
12
- - `voice_jvs_b2048_r44100_z16.ts` — **Voice (JVS, Japanese)**. JVS Japanese multi-speaker voice corpus at 44.1 kHz. Use for Japanese-language sources or non-Latin phoneme structure.
13
- - `voice_vctk_b2048_r44100_z22.ts` — **Voice (VCTK, English)**. VCTK English multi-speaker corpus from CSTR Edinburgh, 44.1 kHz. High 22-dim latent — captures more speaker idiosyncrasies.
 
 
14
  - `birds_motherbird_b2048_r48000_z16.ts` — **Birds (Motherbird)**. Bird-vocalization corpus — chirps + textural transients. The canonical 'weird' pick: produces wildly warped output for any arbitrary input.
15
  - `birds_dawnchorus_b2048_r48000_z8.ts` — **Birds (Dawn Chorus)**. Dense overlapping bird vocalizations recorded at dawn. Smaller 8-dim latent — outputs lean ensemble-textural over individual calls.
16
  - `birds_pluma_b2048_r48000_z12.ts` — **Birds (Pluma)**. Lighter, individual bird-call timbres. Mid-size 12-dim latent balances character + clarity.
17
  - `humpbacks_pondbrain_b2048_r48000_z20.ts` — **Humpback Whales**. Humpback-whale song. Long, slow, hauntingly-deep vocal contours — pairs well with sustained input.
18
  - `marinemammals_pondbrain_b2048_r48000_z20.ts` — **Marine Mammals**. Mixed marine-mammal vocalizations — dolphins, orcas, sea-life clicks and cries.
 
 
19
  - `guitar_iil_b2048_r48000_z16.ts` — **Guitar (IIL)**. Acoustic / electric guitar timbre. Good demo for transferring voice or synth input into a plucked-string voice.
20
  - `organ_bach_b2048_r48000_z16.ts` — **Organ (Bach)**. Pipe-organ timbre trained on Bach repertoire. Sustained harmonic textures — pairs well with melodic input.
21
  - `organ_archive_b2048_r48000_z16.ts` — **Organ (Archive)**. Historical pipe-organ recordings — broader, dustier textures than the Bach model. Good for film-score atmospheres.
22
  - `sax_soprano_franziskaschroeder_b2048_r48000_z20.ts` — **Soprano Sax (Schroeder)**. Soprano-saxophone extended techniques by Franziska Schroeder. Multiphonics, growls, key clicks. 20-dim latent — captures fine-grained articulation.
 
 
 
 
23
  - `water_pondbrain_b2048_r48000_z16.ts` — **Water (PondBrain)**. Water / aquatic textures. Treats any input as if it were running through liquid — bubbles, ripples, splashes.
24
  - `magnets_b2048_r48000_z8.ts` — **Magnets**. Ferromagnetic / electromagnetic resonance textures — metallic hums, distant industrial buzz, magnetized-string ringing.
25
- - `mrp_strengjavera_b2048_r44100_z16.ts` — **Magnetic Resonator Piano (Strengjavera)**. Magnetic Resonator Piano. Sustained metallic-string overtones produced by electromagnetically driving piano strings — 44.1 kHz.
26
- - `crozzoli_bigensemblesmusic_18d.ts` — **Big Ensemble Music (Crozzoli)**. Big-ensemble orchestral music (M. Crozzoli). Broad 18-dim latent for hugely-textured renders. Sample rate not embedded in filename — defaults to 48000; override via panel if needed.
27
 
28
- Each `.ts` checkpoint is accompanied by a `<stem>.json` sidecar with name, license, sample-rate, latent-dim, source URL, and a one-line description.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ tags:
4
+ - audio
5
+ - rave
6
+ - timbre-transfer
7
+ - neural-synthesis
8
+ - ircam
9
+ - maestro
10
+ language:
11
+ - en
12
+ pipeline_tag: audio-to-audio
13
+ ---
14
+
15
  # RAVE — AEmotionStudio mirror
16
 
17
+ Curated mirror of public **RAVE** (Realtime Audio Variational autoEncoder) checkpoints, used
18
+ by MAESTRO's RAVE Timbre Transfer panel (opt-in starter pack). Sources:
19
+
20
+ - The [Intelligent-Instruments-Lab/rave-models](https://huggingface.co/Intelligent-Instruments-Lab/rave-models) curated set (birds, voices, organs, water, etc.).
21
+ - The [official ACIDS-IRCAM public catalog](https://acids-ircam.github.io/rave_models_download.html), pulled from the canonical anonymous API at `https://play.forum.ircam.fr/rave-vst-api/get_available_models`.
22
+
23
+ RAVE was developed by [Antoine Caillon](https://caillonantoine.github.io/) and the
24
+ [ACIDS team at IRCAM](https://www.ircam.fr/). Paper: [arXiv:2111.05011](https://arxiv.org/abs/2111.05011).
25
+ Upstream code: [acids-ircam/RAVE](https://github.com/acids-ircam/RAVE).
26
+
27
+ ## License
28
+
29
+ **CC-BY-NC-4.0** — non-commercial use only, inherited from the upstream distributions.
30
+ Generated audio is fine for non-commercial use. Commercial use of the *models themselves*
31
+ (e.g. shipping them inside a paid product) requires permission from the original authors / IRCAM.
32
+
33
+ Per MAESTRO's stance (see `LICENSE_AUDIT.md` and the `feedback_download_on_demand_licensing`
34
+ memory), these weights are fetched *on demand* by the end user — the user (not MAESTRO the
35
+ binary) is the licensee.
36
 
37
+ ---
38
 
39
+ ## Models — IIL-curated set (b2048 streaming exports, 18 models)
40
 
41
+ Each `.ts` checkpoint has a `<stem>.json` sidecar with name, license, sample-rate, latent-dim,
42
+ source URL, and a one-line description.
43
+
44
+ ### Voice / speech
45
  - `voice_vocalset_b2048_r48000_z16.ts` — **Voice (VocalSet)**. Voice timbre trained on the VocalSet corpus — covers vocal techniques across multiple singers. Use for the canonical 'make this sound like a voice' transfer.
46
  - `voice-multi-b2048-r48000-z11.ts` — **Voice (Multi-speaker)**. Aggregated multi-speaker voice corpus. Wider speaker diversity than VocalSet — produces more 'average human' renders.
47
+ - `voice_hifitts_b2048_r48000_z16.ts` — **Voice (HiFi-TTS)**. High-fidelity expressive English speech corpus. Cleaner, more articulate than the multi-speaker model.
48
+ - `voice_jvs_b2048_r44100_z16.ts` — **Voice (JVS, Japanese)**. JVS Japanese multi-speaker corpus at 44.1 kHz. Use for Japanese-language sources or non-Latin phoneme structure.
49
+ - `voice_vctk_b2048_r44100_z22.ts` — **Voice (VCTK, English)**. VCTK English multi-speaker corpus from CSTR Edinburgh, 44.1 kHz. High 22-dim latent — captures more speaker idiosyncrasies.
50
+
51
+ ### Bird / wildlife
52
  - `birds_motherbird_b2048_r48000_z16.ts` — **Birds (Motherbird)**. Bird-vocalization corpus — chirps + textural transients. The canonical 'weird' pick: produces wildly warped output for any arbitrary input.
53
  - `birds_dawnchorus_b2048_r48000_z8.ts` — **Birds (Dawn Chorus)**. Dense overlapping bird vocalizations recorded at dawn. Smaller 8-dim latent — outputs lean ensemble-textural over individual calls.
54
  - `birds_pluma_b2048_r48000_z12.ts` — **Birds (Pluma)**. Lighter, individual bird-call timbres. Mid-size 12-dim latent balances character + clarity.
55
  - `humpbacks_pondbrain_b2048_r48000_z20.ts` — **Humpback Whales**. Humpback-whale song. Long, slow, hauntingly-deep vocal contours — pairs well with sustained input.
56
  - `marinemammals_pondbrain_b2048_r48000_z20.ts` — **Marine Mammals**. Mixed marine-mammal vocalizations — dolphins, orcas, sea-life clicks and cries.
57
+
58
+ ### Instruments
59
  - `guitar_iil_b2048_r48000_z16.ts` — **Guitar (IIL)**. Acoustic / electric guitar timbre. Good demo for transferring voice or synth input into a plucked-string voice.
60
  - `organ_bach_b2048_r48000_z16.ts` — **Organ (Bach)**. Pipe-organ timbre trained on Bach repertoire. Sustained harmonic textures — pairs well with melodic input.
61
  - `organ_archive_b2048_r48000_z16.ts` — **Organ (Archive)**. Historical pipe-organ recordings — broader, dustier textures than the Bach model. Good for film-score atmospheres.
62
  - `sax_soprano_franziskaschroeder_b2048_r48000_z20.ts` — **Soprano Sax (Schroeder)**. Soprano-saxophone extended techniques by Franziska Schroeder. Multiphonics, growls, key clicks. 20-dim latent — captures fine-grained articulation.
63
+ - `mrp_strengjavera_b2048_r44100_z16.ts` — **Magnetic Resonator Piano (Strengjavera)**. Sustained metallic-string overtones produced by electromagnetically driving piano strings — 44.1 kHz.
64
+ - `crozzoli_bigensemblesmusic_18d.ts` — **Big Ensemble Music (Crozzoli)**. Big-ensemble orchestral music (M. Crozzoli). Broad 18-dim latent for hugely-textured renders. Sample rate not embedded in filename — defaults to 48 kHz.
65
+
66
+ ### Textures / environment
67
  - `water_pondbrain_b2048_r48000_z16.ts` — **Water (PondBrain)**. Water / aquatic textures. Treats any input as if it were running through liquid — bubbles, ripples, splashes.
68
  - `magnets_b2048_r48000_z8.ts` — **Magnets**. Ferromagnetic / electromagnetic resonance textures — metallic hums, distant industrial buzz, magnetized-string ringing.
 
 
69
 
70
+ ---
71
+
72
+ ## Models — ACIDS public catalog (10 models, mirrored 2026-05-18)
73
+
74
+ Pulled from the canonical anonymous-download endpoint `https://play.forum.ircam.fr/rave-vst-api/get_model/<slug>`.
75
+ Each `.ts` has a matching `<slug>.json` sidecar in the same schema as the IIL set.
76
+
77
+ | Slug | Display name | Type | Author | Year | Size | Prior |
78
+ |---|---|---|---|---|---|---|
79
+ | `VCTK` | VCTK (English Speech) | RAVE v1 (default) | Jb Dupuy | 2022 | 177 MB | ✓ |
80
+ | `darbouka_onnx` | Darbouka (Percussion) | RAVE v2 (ONNX) | Antoine Caillon | 2022 | 26 MB | – |
81
+ | `nasa` | NASA Apollo 11 | RAVE v1 (default) | Antoine Caillon | 2022 | 159 MB | ✓ |
82
+ | `percussion` | Percussion (Mixed) | RAVE v1 (default) | Antoine Caillon | 2022 | 71 MB | ✓ |
83
+ | `vintage` | Vintage Music | RAVE v1 (large) | Antoine Caillon | 2022 | 482 MB | ✓ |
84
+ | `isis` | ISiS (IRCAM Vocal DB) | RAVE v2 | A. Chemla–Romeu-Santos | 2023 | 149 MB | – |
85
+ | `musicnet` | MusicNet (Classical) | RAVE v2 | A. Chemla–Romeu-Santos | 2023 | 237 MB | ✓ |
86
+ | `sol_ordinario` | Studio OnLine (Ordinario) | RAVE v2 | A. Chemla–Romeu-Santos | 2023 | 149 MB | – |
87
+ | `sol_full` | Studio OnLine (Full) | RAVE v2 | A. Chemla–Romeu-Santos | 2023 | 149 MB | – |
88
+ | `sol_ordinario_fast` | Studio OnLine (Ordinario, fast) | RAVE v2 (small) | A. Chemla–Romeu-Santos | 2023 | 43 MB | – |
89
+
90
+ **ACIDS set total: ~1.6 GB across 10 models.**
91
+
92
+ > Note: `VCTK.ts` (ACIDS v1, 48 kHz, original 2022 release) and `voice_vctk_b2048_r44100_z22.ts`
93
+ > (IIL v2 retrain, 44.1 kHz) are *different* models trained on the same source corpus —
94
+ > keep both for comparison.
95
+
96
+ ---
97
+
98
+ ## File format
99
+
100
+ Each `*.ts` is a [TorchScript](https://pytorch.org/docs/stable/jit.html) export of the RAVE model,
101
+ streaming-mode (causal convolutions, cached state) — ready for realtime or offline inference.
102
+
103
+ ```python
104
+ import torch
105
+ model = torch.jit.load("vintage.ts")
106
+ # Encode (B, 1, T) → latents
107
+ z = model.encode(audio)
108
+ # Decode latents → audio
109
+ y = model.decode(z)
110
+ ```
111
+
112
+ Models with "Prior available" additionally ship a learned prior that can generate latents
113
+ autoregressively (see the [RAVE repo](https://github.com/acids-ircam/RAVE) for usage).
114
+
115
+ ## Where to find more RAVE models
116
+
117
+ - [Neutone FX models](https://neutone.ai/fx/models) — community + curated `.nm` files (the Neutone wrapper format).
118
+ - [IRCAM Forum projects](https://forum.ircam.fr/) — individual user-submitted models; many require Forum account.
119
+ - [acids-ircam GitHub releases](https://github.com/acids-ircam/RAVE/releases) — reference checkpoints from the maintainers.
120
+ - [IRCAM RAVE Model Challenge 2025](https://forum.ircam.fr/collections/detail/rave-model-challenge-models/) — 11 prize-winner / submission models gated behind a Forum account.
121
+
122
+ ## Citation
123
+
124
+ ```bibtex
125
+ @inproceedings{caillon2021rave,
126
+ title={RAVE: A variational autoencoder for fast and high-quality neural audio synthesis},
127
+ author={Caillon, Antoine and Esling, Philippe},
128
+ booktitle={arXiv preprint arXiv:2111.05011},
129
+ year={2021}
130
+ }
131
+ ```
VCTK.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "name": "VCTK (English Speech)",
3
+ "license": "CC-BY-NC-4.0",
4
+ "sample_rate": 48000,
5
+ "latent_dim": null,
6
+ "source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/VCTK",
7
+ "description": "Trained on the VCTK speech corpus (CSTR Edinburgh) \u2014 multi-speaker English. Different from voice_vctk_b2048_r44100_z22.ts in this same repo (which is the IIL-curated v2 retrain); this is the original ACIDS v1 release.",
8
+ "author": "Jb Dupuy",
9
+ "release_date": "2022-05-11",
10
+ "model_type": "RAVE v1 (default)",
11
+ "prior_available": true
12
+ }
VCTK.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6d8c8cb4726a2660698de7b6311131975f9f2ddd5915113590b2ae2d9f9c4a38
3
+ size 177167756
darbouka_onnx.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "name": "Darbouka (Percussion)",
3
+ "license": "CC-BY-NC-4.0",
4
+ "sample_rate": 48000,
5
+ "latent_dim": 4,
6
+ "source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/darbouka_onnx",
7
+ "description": "8 hours of darbouka (goblet drum) recordings \u2014 Middle-Eastern hand drum. ONNX-exported v2. Sharp transients + breathy decays. Good for re-skinning any percussive input into Mediterranean / North-African textures.",
8
+ "author": "Antoine Caillon",
9
+ "release_date": "2022-09-21",
10
+ "model_type": "RAVE v2 (ONNX)",
11
+ "prior_available": false
12
+ }
darbouka_onnx.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2631209242a90aaeca4c1c2985b1b104595c60713b44fef0a1c1e49b8e995f98
3
+ size 26350036
isis.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "name": "ISiS (IRCAM Vocal Database)",
3
+ "license": "CC-BY-NC-4.0",
4
+ "sample_rate": 44100,
5
+ "latent_dim": 8,
6
+ "source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/isis",
7
+ "description": "Trained on the IRCAM ISiS vocal analysis-synthesis database. Operatic + extended-technique vocal timbres with rich formant structure. See forum.ircam.fr/projects/detail/isis/ for the source corpus.",
8
+ "author": "Axel Chemla\u2013Romeu-Santos",
9
+ "release_date": "2023-12-10",
10
+ "model_type": "RAVE v2",
11
+ "prior_available": false
12
+ }
isis.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:16c5502112eb27cd9348690a776e1d60ee4f9002e2855d39354d75401b8c408b
3
+ size 148725070
musicnet.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "name": "MusicNet (Classical Ensemble)",
3
+ "license": "CC-BY-NC-4.0",
4
+ "sample_rate": 44100,
5
+ "latent_dim": 16,
6
+ "source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/musicnet",
7
+ "description": "Trained on the MusicNet database (annotated classical chamber + orchestral recordings). Broad 16-dim latent captures multi-instrument timbral blends; good 'classical-ensemble' transfer target.",
8
+ "author": "Axel Chemla\u2013Romeu-Santos",
9
+ "release_date": "2023-12-10",
10
+ "model_type": "RAVE v2",
11
+ "prior_available": true
12
+ }
musicnet.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5228bc514c0a38280b774fd5a75ba5883e40013419a5acb0840ebd56d36e855b
3
+ size 236989418
nasa.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "name": "NASA Apollo 11",
3
+ "license": "CC-BY-NC-4.0",
4
+ "sample_rate": 48000,
5
+ "latent_dim": 8,
6
+ "source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/nasa",
7
+ "description": "Trained on radio communications from the Apollo 11 mission. Distinctive comms-filtered voice + cosmic interference textures. Use any input \u2192 eerie spacefaring transmissions. Source: youtube.com/watch?v=DejhGSEu8wk",
8
+ "author": "Antoine Caillon",
9
+ "release_date": "2022-09-21",
10
+ "model_type": "RAVE v1 (default)",
11
+ "prior_available": true
12
+ }
nasa.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:60b198ad1ae9450f16c00c6a27ca9332924eb15e57c1a1efb90909038dcde458
3
+ size 159284126
percussion.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "name": "Percussion (Mixed)",
3
+ "license": "CC-BY-NC-4.0",
4
+ "sample_rate": 44100,
5
+ "latent_dim": null,
6
+ "source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/percussion",
7
+ "description": "8 hours of mixed percussion recordings \u2014 wide range of struck / hit / shaken sources. Good general-purpose 'turn this into rhythm' transfer; complements darbouka with broader timbral coverage.",
8
+ "author": "Antoine Caillon",
9
+ "release_date": "2022-09-21",
10
+ "model_type": "RAVE v1 (default)",
11
+ "prior_available": true
12
+ }
percussion.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:12e044fa4cf0f461fa78dc5e108253d5ffaa395604a66aa43e3a9c3967dee0f9
3
+ size 71097032
sol_full.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "name": "Studio OnLine (Full)",
3
+ "license": "CC-BY-NC-4.0",
4
+ "sample_rate": 44100,
5
+ "latent_dim": 8,
6
+ "source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/sol_full",
7
+ "description": "Full Studio OnLine database \u2014 ordinario PLUS extended techniques across every catalogued instrument. Bigger sonic palette than sol_ordinario at the cost of less consistency.",
8
+ "author": "Axel Chemla\u2013Romeu-Santos",
9
+ "release_date": "2023-12-10",
10
+ "model_type": "RAVE v2",
11
+ "prior_available": false
12
+ }
sol_full.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dba344fbd634937ed72779264d6d30fb85b9e125e7e1ab687c3c8145902d4a94
3
+ size 148724592
sol_ordinario.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "name": "Studio OnLine (Ordinario)",
3
+ "license": "CC-BY-NC-4.0",
4
+ "sample_rate": 44100,
5
+ "latent_dim": 4,
6
+ "source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/sol_ordinario",
7
+ "description": "Ordinario (standard playing-technique) recordings from IRCAM's Studio OnLine database \u2014 strings, winds, brass, percussion in conventional articulations. Compact 4-dim latent; very clean re-voicing.",
8
+ "author": "Axel Chemla\u2013Romeu-Santos",
9
+ "release_date": "2023-12-10",
10
+ "model_type": "RAVE v2",
11
+ "prior_available": false
12
+ }
sol_ordinario.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:84a620c61b8d3a28ee35c4abdc796a53bdd17c97adf81e3e4c7243cf8894a3ac
3
+ size 148733766
sol_ordinario_fast.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "name": "Studio OnLine (Ordinario, fast)",
3
+ "license": "CC-BY-NC-4.0",
4
+ "sample_rate": 44100,
5
+ "latent_dim": 8,
6
+ "source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/sol_ordinario_fast",
7
+ "description": "Smaller / faster variant of sol_ordinario (43 MB vs 149 MB). Lower-fidelity rendering but much cheaper on CPU. Good default starter pick for live exploration before committing to the larger ordinario model.",
8
+ "author": "Axel Chemla\u2013Romeu-Santos",
9
+ "release_date": "2023-10-10",
10
+ "model_type": "RAVE v2 (small)",
11
+ "prior_available": false
12
+ }
sol_ordinario_fast.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9a8d32002efd6f385b521d64691e0d2d90d691ca4c50e85149ea88c6be399793
3
+ size 43058211
vintage.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "name": "Vintage Music",
3
+ "license": "CC-BY-NC-4.0",
4
+ "sample_rate": 48000,
5
+ "latent_dim": null,
6
+ "source_url": "https://play.forum.ircam.fr/rave-vst-api/get_model/vintage",
7
+ "description": "80 hours of vintage-music recordings \u2014 the canonical 'large' ACIDS demo checkpoint (482 MB). Lush, lo-fi, archival-feeling outputs. Slowest of the set on CPU but the most musically-coherent timbre transfers.",
8
+ "author": "Antoine Caillon",
9
+ "release_date": "2022-09-21",
10
+ "model_type": "RAVE v1 (large)",
11
+ "prior_available": true
12
+ }
vintage.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ba4bb7e281658f2de0e97d47ecbdb7793169804bd1833859619f528b128ebf15
3
+ size 481781124