Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
papple23g 's Collections
V2V
V2Model3D
A2T
T2T (LLM)
T2I
T2V
I2T (LLM-vision)
I2I
I2Model3D
I2V
STT
TTS / Voice Clone
T2A
A2A (Audio)
Text Embedding / Multimodal Embedding

A2T

updated Mar 3, 2025
Upvote
-

  • Running
    Featured
    61

    SoundwaveDemo

    📉
    61

    Process audio and generate text output based on instructions

    Note TTS、辨識語言.聲音.演講者的性別、總結說話內容

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs