Improve model card for DMOSpeech 2: Add pipeline tag and detailed usage

#2
by nielsr HF Staff - opened

This PR significantly enhances the model card for DMOSpeech 2 by adding crucial metadata and comprehensive information.

Key updates include:

  • Adding the pipeline_tag: text-to-speech to ensure the model is properly categorized and discoverable on the Hugging Face Hub (e.g., at https://huggingface.co/models?pipeline_tag=text-to-speech).
  • Including the full paper abstract to provide a detailed overview of the model's capabilities and contributions.
  • Providing clear links to the official paper on Hugging Face, the project page, and the GitHub repository.
  • Adding detailed inference instructions, including environment setup and checkpoint download, directly referencing the demo.ipynb in the official GitHub repository as the primary method for sample usage, thereby avoiding an incorrect code snippet.

This update will greatly improve the clarity and utility of the DMOSpeech 2 model card for the community.

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment