Improve model card for DMOSpeech 2: Add pipeline tag and detailed usage

by nielsr HF Staff - opened Jul 26, 2025

←

This PR significantly enhances the model card for DMOSpeech 2 by adding crucial metadata and comprehensive information.

Key updates include:

Adding the pipeline_tag: text-to-speech to ensure the model is properly categorized and discoverable on the Hugging Face Hub (e.g., at https://huggingface.co/models?pipeline_tag=text-to-speech).
Including the full paper abstract to provide a detailed overview of the model's capabilities and contributions.
Providing clear links to the official paper on Hugging Face, the project page, and the GitHub repository.
Adding detailed inference instructions, including environment setup and checkpoint download, directly referencing the demo.ipynb in the official GitHub repository as the primary method for sample usage, thereby avoiding an incorrect code snippet.

This update will greatly improve the clarity and utility of the DMOSpeech 2 model card for the community.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment