File size: 1,276 Bytes
a9d9323
 
 
 
12a0eab
 
 
71c4ce7
8d80363
71c4ce7
8d80363
71c4ce7
8d80363
71c4ce7
8d80363
71c4ce7
12a0eab
8d80363
71c4ce7
12a0eab
8d80363
12a0eab
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
license: mit
---

Selection of trained models for the RhythGen project: https://github.com/efraimdahl/RhythGen

# Models**LAS:** In-attention conditioning with syncopation labels on the Lieder dataset.

• **LMR2:** Attention modulation with spectral weight profiles profiles and a learnable scale (initialized at 10) on the Lieder dataset.

• **LB:** Baseline NotaGen (small) model without any conditioning, finetuned on the Lieder dataset.

• **HAS2:** In-attention conditioning with syncopation labels on the RAG-RH dataset.

• **RAS2:** In-attention conditioning with syncopation scores and voice masking on the RAG-
collection.

• **RB:** Baseline NotaGen model without any conditioning, finetuned on the RAG-collection.


# Base Model
All models are fintuned from [NotaGen small](https://huggingface.co/ElectricAlexis/NotaGen/blob/main/weights_notagen_pretrain_p_size_16_p_length_2048_p_layers_12_c_layers_3_h_size_768_lr_0.0002_batch_8.pth).

# Datasets
We fine-tuned NotaGen-small on a corpus of 1000-1500 pieces from either the [Lieder Dataset](https://github.com/OpenScore/Lieder) which is public or the [RAG Collection](https://dspace.library.uu.nl/bitstream/handle/1874/354841/OdekerkenVolkKoops2017.pdf?sequence=1) which is available on request.