— Inference Api —
📟
31
Generate text based on your input
Generate text based on your input
Chat with an image using Phi-3 Vision model
Meta Llama3 8b with Llava Multimodal capabilities
Decompose an image into editable layers
Multimodal OCR model for complex document understanding.
State-of-the-art OCR with 90+ language support
Generate images from text prompts
MiniCPM-V 4.6 Ultra-Efficient Multimodal AI
Chat with AI using text, images, and videos
Extract data from documents into JSON