A2T - a papple23g Collection

papple23g 's Collections

I2T (LLM-vision)

TTS / Voice Clone

Text Embedding / Multimodal Embedding

A2T

updated Mar 3, 2025

Running

Featured

61

SoundwaveDemo

📉

61

Process audio and generate text output based on instructions

Note TTS、辨識語言.聲音.演講者的性別、總結說話內容