Model Description

This model, QA-MDT, allows for easy setup and usage for generating music from text prompts. It incorporates a quality-aware training strategy to improve the fidelity of generated music.

How to Use

A Hugging Face Diffusers implementation is available at this model and this space. For more detailed instructions and the official PyTorch implementation, please refer to the project's Github repository and project page.

The model was presented in the paper QA-MDT: Quality-aware Masked Diffusion Transformer for Enhanced Music Generation.

Downloads last month: 34

Datasets used to train lichang0928/QA-MDT

Paper for lichang0928/QA-MDT

QA-MDT: Quality-aware Masked Diffusion Transformer for Enhanced Music Generation

Paper • 2405.15863 • Published May 24, 2024 • 3