--- title: Oxlo.ai emoji: ⚡ colorFrom: green colorTo: blue sdk: static pinned: false --- # Oxlo.ai - Flat Monthly Pricing for AI Inference

Oxlo.ai Logo

## What is Oxlo.ai? Oxlo.ai is a **developer-first AI inference platform** with **request-based pricing**. Unlike token-based providers, we charge a flat fee per API call - a 100-token prompt costs the same as a 50,000-token prompt. ## 💡 Why Oxlo.ai? | Feature | Oxlo.ai | Token-Based Providers | |---------|---------|----------------------| | Pricing model | **Per request** (flat) | Per token (variable) | | Cost predictability | ✅ Fixed monthly bill | ❌ Scales with usage | | Long-context cost | Same as short context | 10-100x more expensive | | OpenAI SDK compatible | ✅ Drop-in replacement | Varies | ## 📦 Available Models (35+) ### Text & Chat - Qwen 3 32B, Llama 3.3 70B, DeepSeek R1 671B, DeepSeek V3.2 - Llama 4 Maverick 17B, GPT-OSS 120B, Mistral 7B, Gemma 3 ### Code - Qwen 3 Coder 30B, DeepSeek Coder 33B ### Vision - Gemma 3 27B, Kimi K2.5, Kimi K2 Thinking ### Image Generation - Oxlo Image Pro, SDXL Lightning, Stable Diffusion 1.5, Flux 1.5 ### Audio - Whisper Large v3, Whisper Turbo, Kokoro 82M TTS ### Embeddings & Detection - BGE-Large, E5-Large, YOLOv9, YOLOv11 ## 🔗 Quick Start ```python from openai import OpenAI client = OpenAI( base_url="https://api.oxlo.ai/v1", api_key="your-oxlo-api-key" ) response = client.chat.completions.create( model="qwen-3-32b", messages=[{"role": "user", "content": "Hello!"}] ) print(response.choices[0].message.content) ``` ## 🔗 Links - [Website](https://www.oxlo.ai/) - [Documentation](https://docs.oxlo.ai/docs/) - [Get Started](https://portal.oxlo.ai) - [GitHub](https://github.com/Cyborg-Network) - Contact: [hello@oxlo.ai](mailto:hello@oxlo.ai)