moonshotai/Kimi-K2.5
Image-Text-to-Text • 1.1T • Updated • 1.37M • • 2.81k
The complete integration stack: top LLMs, VLMs, image/video gen, TTS/ASR, embeddings, coding models, and agentic tools. Powered by HuggingFace Inferen
Generate custom speech from text, voice descriptions, or samples
Transform image viewpoint with adjustable camera angles