File size: 1,276 Bytes
a9d9323 12a0eab 71c4ce7 8d80363 71c4ce7 8d80363 71c4ce7 8d80363 71c4ce7 8d80363 71c4ce7 12a0eab 8d80363 71c4ce7 12a0eab 8d80363 12a0eab |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
---
license: mit
---
Selection of trained models for the RhythGen project: https://github.com/efraimdahl/RhythGen
# Models
• **LAS:** In-attention conditioning with syncopation labels on the Lieder dataset.
• **LMR2:** Attention modulation with spectral weight profiles profiles and a learnable scale (initialized at 10) on the Lieder dataset.
• **LB:** Baseline NotaGen (small) model without any conditioning, finetuned on the Lieder dataset.
• **HAS2:** In-attention conditioning with syncopation labels on the RAG-RH dataset.
• **RAS2:** In-attention conditioning with syncopation scores and voice masking on the RAG-
collection.
• **RB:** Baseline NotaGen model without any conditioning, finetuned on the RAG-collection.
# Base Model
All models are fintuned from [NotaGen small](https://huggingface.co/ElectricAlexis/NotaGen/blob/main/weights_notagen_pretrain_p_size_16_p_length_2048_p_layers_12_c_layers_3_h_size_768_lr_0.0002_batch_8.pth).
# Datasets
We fine-tuned NotaGen-small on a corpus of 1000-1500 pieces from either the [Lieder Dataset](https://github.com/OpenScore/Lieder) which is public or the [RAG Collection](https://dspace.library.uu.nl/bitstream/handle/1874/354841/OdekerkenVolkKoops2017.pdf?sequence=1) which is available on request. |