Improve model card for LeVo: High-Quality Song Generation with `text-to-audio` pipeline tag and detailed usage

by nielsr HF Staff - opened Oct 24, 2025

←

This PR significantly enhances the model card for LeVo: High-Quality Song Generation with Multi-Preference Alignment by:

Adding the pipeline_tag: text-to-audio to improve discoverability on the Hugging Face Hub.
Providing a comprehensive description of the model, including its abstract and key features from the paper and GitHub README.
Including direct links to the official paper, project page, GitHub repository, and Hugging Face Space demo.
Adding detailed installation instructions and usage examples with code snippets extracted directly from the GitHub README, focusing on the generate.sh script and input formats.
Including information about the Gradio UI and evaluation performance tables.

This update ensures that users can easily understand the model's capabilities and quickly get started with its usage.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment