RhythGen / README.md
efraimdahl's picture
Update README.md
71c4ce7 verified
---
license: mit
---
Selection of trained models for the RhythGen project: https://github.com/efraimdahl/RhythGen
# Models
**LAS:** In-attention conditioning with syncopation labels on the Lieder dataset.
**LMR2:** Attention modulation with spectral weight profiles profiles and a learnable scale (initialized at 10) on the Lieder dataset.
**LB:** Baseline NotaGen (small) model without any conditioning, finetuned on the Lieder dataset.
**HAS2:** In-attention conditioning with syncopation labels on the RAG-RH dataset.
**RAS2:** In-attention conditioning with syncopation scores and voice masking on the RAG-
collection.
**RB:** Baseline NotaGen model without any conditioning, finetuned on the RAG-collection.
# Base Model
All models are fintuned from [NotaGen small](https://huggingface.co/ElectricAlexis/NotaGen/blob/main/weights_notagen_pretrain_p_size_16_p_length_2048_p_layers_12_c_layers_3_h_size_768_lr_0.0002_batch_8.pth).
# Datasets
We fine-tuned NotaGen-small on a corpus of 1000-1500 pieces from either the [Lieder Dataset](https://github.com/OpenScore/Lieder) which is public or the [RAG Collection](https://dspace.library.uu.nl/bitstream/handle/1874/354841/OdekerkenVolkKoops2017.pdf?sequence=1) which is available on request.