rhymes-ai/Aria
Image-Text-to-Text • 25B • Updated • 126k • 638
Generate captions using several AI models
Generate 3D room layout from an RGB panorama image
Generate images from sketches, edges, poses, and depth maps
Generate speaker‑labeled transcripts from video or audio