IndexTeam/IndexTTS-2
Text-to-Speech • Updated • 14.5k • 740
Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
Generate speech from text using a reference audio
Create a textured 3D model from a single image
Generate any application by Vibe Coding it
Generate 3D models from images
Scalable and Versatile 3D Generation from images
Generate and preview app code from a text description
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Compare two faces to verify identity