Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
papple23g
's Collections
V2V
V2Model3D
A2T
T2T (LLM)
T2I
T2V
I2T (LLM-vision)
I2I
I2Model3D
I2V
STT
TTS / Voice Clone
T2A
A2A (Audio)
Text Embedding / Multimodal Embedding
A2T
updated
Mar 3, 2025
Upvote
-
Running
Featured
61
SoundwaveDemo
📉
61
Process audio and generate text output based on instructions
Note
TTS、辨識語言.聲音.演講者的性別、總結說話內容
Upvote
-
Share collection
View history
Collection guide
Browse collections