efraimdahl
/

RhythGen

Model card Files Files and versions

RhythGen / README.md

efraimdahl's picture

Update README.md

71c4ce7 verified 4 months ago

|

history blame contribute delete

1.28 kB

	---
	license: mit
	---

	Selection of trained models for the RhythGen project: https://github.com/efraimdahl/RhythGen

	# Models
	• LAS: In-attention conditioning with syncopation labels on the Lieder dataset.

	• LMR2: Attention modulation with spectral weight profiles profiles and a learnable scale (initialized at 10) on the Lieder dataset.

	• LB: Baseline NotaGen (small) model without any conditioning, finetuned on the Lieder dataset.

	• HAS2: In-attention conditioning with syncopation labels on the RAG-RH dataset.

	• RAS2: In-attention conditioning with syncopation scores and voice masking on the RAG-
	collection.

	• RB: Baseline NotaGen model without any conditioning, finetuned on the RAG-collection.


	# Base Model
	All models are fintuned from [NotaGen small](https://huggingface.co/ElectricAlexis/NotaGen/blob/main/weights_notagen_pretrain_p_size_16_p_length_2048_p_layers_12_c_layers_3_h_size_768_lr_0.0002_batch_8.pth).

	# Datasets
	We fine-tuned NotaGen-small on a corpus of 1000-1500 pieces from either the [Lieder Dataset](https://github.com/OpenScore/Lieder) which is public or the [RAG Collection](https://dspace.library.uu.nl/bitstream/handle/1874/354841/OdekerkenVolkKoops2017.pdf?sequence=1) which is available on request.