Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

novateur
/
WavTokenizer-medium-speech-75token

Model card Files Files and versions
xet
Community
1
  • WavTokenizer

    WavTokenizer

    SOTA Discrete Codec Models With Forty Tokens Per Second for Audio Language Modeling

    arXiv demo github

    Downloads last month

    -

    Downloads are not tracked for this model. How to track
    Inference Providers NEW
    This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

    Spaces using novateur/WavTokenizer-medium-speech-75token 8

    πŸš€
    ccibeekeoc42/Aware-Demo
    πŸš€
    okewunmi/tts
    🏒
    Hameed13/my_news_podcast
    πŸŽ™οΈ
    Hameed13/Huggingface_News_Podcast
    ⚑
    rafibra93/hq_tts_libingo
    πŸ₯
    Emmanuelah/ai-healthmate-voice
    πŸ—£οΈ
    Emmanuelah/ai-healthmate-voice-api
    πŸ—£οΈ
    Codentia/ai-healthmate-voice

    Collection including novateur/WavTokenizer-medium-speech-75token

    WavTokenizer-Medium-Large

    Collection
    https://arxiv.org/abs/2408.16532 β€’ 4 items β€’ Updated Feb 25, 2025 β€’ 12

    Paper for novateur/WavTokenizer-medium-speech-75token

    WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

    Paper β€’ 2408.16532 β€’ Published Aug 29, 2024 β€’ 50
    Company
    TOS Privacy About Careers
    Website
    Models Datasets Spaces Pricing Docs