---
title: Oxlo.ai
emoji: ⚡
colorFrom: green
colorTo: blue
sdk: static
pinned: false
---
# Oxlo.ai - Flat Monthly Pricing for AI Inference
## What is Oxlo.ai?
Oxlo.ai is a **developer-first AI inference platform** with **request-based pricing**.
Unlike token-based providers, we charge a flat fee per API call - a 100-token prompt
costs the same as a 50,000-token prompt.
## 💡 Why Oxlo.ai?
| Feature | Oxlo.ai | Token-Based Providers |
|---------|---------|----------------------|
| Pricing model | **Per request** (flat) | Per token (variable) |
| Cost predictability | ✅ Fixed monthly bill | ❌ Scales with usage |
| Long-context cost | Same as short context | 10-100x more expensive |
| OpenAI SDK compatible | ✅ Drop-in replacement | Varies |
## 📦 Available Models (35+)
### Text & Chat
- Qwen 3 32B, Llama 3.3 70B, DeepSeek R1 671B, DeepSeek V3.2
- Llama 4 Maverick 17B, GPT-OSS 120B, Mistral 7B, Gemma 3
### Code
- Qwen 3 Coder 30B, DeepSeek Coder 33B
### Vision
- Gemma 3 27B, Kimi K2.5, Kimi K2 Thinking
### Image Generation
- Oxlo Image Pro, SDXL Lightning, Stable Diffusion 1.5, Flux 1.5
### Audio
- Whisper Large v3, Whisper Turbo, Kokoro 82M TTS
### Embeddings & Detection
- BGE-Large, E5-Large, YOLOv9, YOLOv11
## 🔗 Quick Start
```python
from openai import OpenAI
client = OpenAI(
base_url="https://api.oxlo.ai/v1",
api_key="your-oxlo-api-key"
)
response = client.chat.completions.create(
model="qwen-3-32b",
messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
```
## 🔗 Links
- [Website](https://www.oxlo.ai/)
- [Documentation](https://docs.oxlo.ai/docs/)
- [Get Started](https://portal.oxlo.ai)
- [GitHub](https://github.com/Cyborg-Network)
- Contact: [hello@oxlo.ai](mailto:hello@oxlo.ai)