Spaces:

llzai
/

axonhub

Sleeping

App Files Files Community

axonhub / docs /en /api-reference /openai-api.md

llzai

Upload 1793 files

9853396 verified about 1 month ago

preview code

raw

history blame contribute delete

16.5 kB

	# OpenAI API Reference

	## Overview

	AxonHub provides full support for the OpenAI API specification, allowing you to use any OpenAI-compatible client SDK to access models from multiple providers.

	## Key Benefits

	- API Interoperability: Use OpenAI Chat Completions API to call Anthropic, Gemini, and other supported models
	- Zero Code Changes: Continue using your existing OpenAI client SDK without modification
	- Automatic Translation: AxonHub automatically converts between API formats when needed
	- Provider Flexibility: Access any supported AI provider using the OpenAI API format

	## Supported Endpoints

	### OpenAI Chat Completions API

	Endpoints:
	- `POST /v1/chat/completions` - Text generation
	- `GET /v1/models` - List available models

	Example Request:
	```go
	import (
	"github.com/openai/openai-go/v3"
	"github.com/openai/openai-go/v3/option"
	)

	// Create OpenAI client with AxonHub configuration
	client := openai.NewClient(
	option.WithAPIKey("your-axonhub-api-key"),
	option.WithBaseURL("http://localhost:8090/v1"),

	)

	// Call Anthropic model using OpenAI API format
	completion, err := client.Chat.Completions.New(ctx, openai.ChatCompletionNewParams{
	Messages: []openai.ChatCompletionMessageParamUnion{
	openai.UserMessage("Hello, Claude!"),
	},
	Model: openai.ChatModel("claude-3-5-sonnet"),
	},
	option.WithHeader("AH-Trace-Id", "trace-example-123"),
	option.WithHeader("AH-Thread-Id", "thread-example-abc"))
	if err != nil {
	// Handle error appropriately
	panic(err)
	}

	// Access the response content
	responseText := completion.Choices[0].Message.Content
	fmt.Println(responseText)
	```

	### OpenAI Responses API

	AxonHub provides partial support for the OpenAI Responses API. This API offers a simplified interface for single-turn interactions.

	Endpoints:
	- `POST /v1/responses` - Generate a response

	Limitations:
	- ❌ `previous_response_id` is not supported - conversation history must be managed client-side
	- ✅ Basic response generation is fully functional
	- ✅ Streaming responses are supported

	Example Request:
	```go
	import (
	"context"
	"fmt"

	"github.com/openai/openai-go/v3"
	"github.com/openai/openai-go/v3/option"
	"github.com/openai/openai-go/v3/responses"
	"github.com/openai/openai-go/v3/shared"
	)

	// Create OpenAI client with AxonHub configuration
	client := openai.NewClient(
	option.WithAPIKey("your-axonhub-api-key"),
	option.WithBaseURL("http://localhost:8090/v1"),
	)

	ctx := context.Background()

	// Generate a response (previous_response_id not supported)
	params := responses.ResponseNewParams{
	Model: shared.ResponsesModel("gpt-4o"),
	Input: responses.ResponseNewParamsInputUnion{
	OfString: openai.String("Hello, how are you?"),
	},
	}

	response, err := client.Responses.New(ctx, params,
	option.WithHeader("AH-Trace-Id", "trace-example-123"),
	option.WithHeader("AH-Thread-Id", "thread-example-abc"))
	if err != nil {
	panic(err)
	}

	fmt.Println(response.OutputText())
	```

	Example: Streaming Response
	```go
	import (
	"context"
	"fmt"
	"strings"

	"github.com/openai/openai-go/v3"
	"github.com/openai/openai-go/v3/option"
	"github.com/openai/openai-go/v3/responses"
	"github.com/openai/openai-go/v3/shared"
	)

	client := openai.NewClient(
	option.WithAPIKey("your-axonhub-api-key"),
	option.WithBaseURL("http://localhost:8090/v1"),
	)

	ctx := context.Background()

	params := responses.ResponseNewParams{
	Model: shared.ResponsesModel("gpt-4o"),
	Input: responses.ResponseNewParamsInputUnion{
	OfString: openai.String("Tell me a short story about a robot."),
	},
	}

	stream := client.Responses.NewStreaming(ctx, params,
	option.WithHeader("AH-Trace-Id", "trace-example-123"),
	option.WithHeader("AH-Thread-Id", "thread-example-abc"))

	var fullContent strings.Builder
	for stream.Next() {
	event := stream.Current()
	if event.Type == "response.output_text.delta" && event.Delta != "" {
	fullContent.WriteString(event.Delta)
	fmt.Print(event.Delta) // Print as it streams
	}
	}

	if err := stream.Err(); err != nil {
	panic(err)
	}

	fmt.Println("\nComplete response:", fullContent.String())
	```

	## API Translation Capabilities

	AxonHub automatically translates between API formats, enabling powerful scenarios:

	### Use OpenAI SDK with Anthropic Models
	```go
	// OpenAI SDK calling Anthropic model
	completion, err := client.Chat.Completions.New(ctx, openai.ChatCompletionNewParams{
	Messages: []openai.ChatCompletionMessageParamUnion{
	openai.UserMessage("Tell me about artificial intelligence"),
	},
	Model: openai.ChatModel("claude-3-5-sonnet"), // Anthropic model
	})

	// Access response
	responseText := completion.Choices[0].Message.Content
	fmt.Println(responseText)
	// AxonHub automatically translates OpenAI format → Anthropic format
	```

	### Use OpenAI SDK with Gemini Models
	```go
	// OpenAI SDK calling Gemini model
	completion, err := client.Chat.Completions.New(ctx, openai.ChatCompletionNewParams{
	Messages: []openai.ChatCompletionMessageParamUnion{
	openai.UserMessage("Explain neural networks"),
	},
	Model: openai.ChatModel("gemini-2.5"), // Gemini model
	})

	// Access response
	responseText := completion.Choices[0].Message.Content
	fmt.Println(responseText)
	// AxonHub automatically translates OpenAI format → Gemini format
	```

	## Embedding API

	AxonHub provides comprehensive support for text and multimodal embedding generation through OpenAI-compatible API.

	Endpoints:
	- `POST /v1/embeddings` - OpenAI-compatible embedding API

	Supported Input Types:
	- Single text string
	- Array of text strings
	- Token arrays (integers)
	- Multiple token arrays

	Supported Encoding Formats:
	- `float` - Default, returns embedding vectors as float arrays
	- `base64` - Returns embeddings as base64-encoded strings

	### Request Format

	```json
	{
	"input": "The text to embed",
	"model": "text-embedding-3-small",
	"encoding_format": "float",
	"dimensions": 1536,
	"user": "user-id"
	}
	```

	Parameters:

	\| Parameter \| Type \| Required \| Description \|
	\|-----------\|------\|----------\|-------------\|
	\| `input` \| string \\| string[] \\| number[] \\| number[][] \| ✅ \| The text(s) to embed. Can be a single string, array of strings, token array, or multiple token arrays. \|
	\| `model` \| string \| ✅ \| The model to use for embedding generation. \|
	\| `encoding_format` \| string \| ❌ \| Format to return embeddings in. Either `float` or `base64`. Default: `float`. \|
	\| `dimensions` \| integer \| ❌ \| Number of dimensions for the output embeddings. \|
	\| `user` \| string \| ❌ \| Unique identifier for the end-user. \|

	### Response Format

	```json
	{
	"object": "list",
	"data": [
	{
	"object": "embedding",
	"embedding": [0.123, 0.456, ...],
	"index": 0
	}
	],
	"model": "text-embedding-3-small",
	"usage": {
	"prompt_tokens": 4,
	"total_tokens": 4
	}
	}
	```

	### Examples

	OpenAI SDK (Python):
	```python
	import openai

	client = openai.OpenAI(
	api_key="your-axonhub-api-key",
	base_url="http://localhost:8090/v1"
	)

	response = client.embeddings.create(
	input="Hello, world!",
	model="text-embedding-3-small"
	)

	print(response.data[0].embedding[:5]) # First 5 dimensions
	```

	OpenAI SDK (Go):
	```go
	package main

	import (
	"context"
	"fmt"
	"log"

	"github.com/openai/openai-go"
	"github.com/openai/openai-go/option"
	)

	func main() {
	client := openai.NewClient(
	option.WithAPIKey("your-axonhub-api-key"),
	option.WithBaseURL("http://localhost:8090/v1"),
	)

	embedding, err := client.Embeddings.New(context.TODO(), openai.EmbeddingNewParams{
	Input: openai.Union[string](openai.String("Hello, world!")),
	Model: openai.String("text-embedding-3-small"),
	option.WithHeader("AH-Trace-Id", "trace-example-123"),
	option.WithHeader("AH-Thread-Id", "thread-example-abc"),
	})
	if err != nil {
	log.Fatal(err)
	}

	fmt.Printf("Embedding dimensions: %d\n", len(embedding.Data[0].Embedding))
	fmt.Printf("First 5 values: %v\n", embedding.Data[0].Embedding[:5])
	}
	```

	Multiple Texts:
	```python
	response = client.embeddings.create(
	input=["Hello, world!", "How are you?"],
	model="text-embedding-3-small"
	)

	for i, data in enumerate(response.data):
	print(f"Text {i}: {data.embedding[:3]}...")
	```

	## Models API

	AxonHub provides an enhanced `/v1/models` endpoint that lists available models with optional extended metadata.

	### Supported Endpoints

	Endpoints:
	- `GET /v1/models` - List available models

	### Query Parameters

	\| Parameter \| Type \| Required \| Description \|
	\|-----------\|------\|----------\|-------------\|
	\| `include` \| string \| ❌ \| Comma-separated list of fields to include, or "all" for all extended fields \|

	### Available Fields for Include

	\| Field \| Type \| Description \|
	\|-------\|------\|-------------\|
	\| `name` \| string \| Display name of the model \|
	\| `description` \| string \| Model description \|
	\| `context_length` \| integer \| Maximum context length in tokens \|
	\| `max_output_tokens` \| integer \| Maximum output tokens \|
	\| `capabilities` \| object \| Model capabilities (vision, tool_call, reasoning) \|
	\| `pricing` \| object \| Pricing information (input, output, cache_read, cache_write) \|
	\| `icon` \| string \| Model icon URL \|
	\| `type` \| string \| Model type (chat, embedding, image, rerank, moderation, tts, stt) \|

	### Response Format (Basic - Default)

	When called without the `include` parameter, the endpoint returns only basic fields:

	```json
	{
	"object": "list",
	"data": [
	{
	"id": "gpt-4",
	"object": "model",
	"created": 1686935002,
	"owned_by": "openai"
	}
	]
	}
	```

	Fields:
	- `id` - Model identifier
	- `object` - Always "model"
	- `created` - Unix timestamp of model creation
	- `owned_by` - Organization that owns the model

	### Response Format (Extended)

	When using `?include=all` or selective fields, the response includes extended metadata:

	```json
	{
	"object": "list",
	"data": [
	{
	"id": "gpt-4",
	"object": "model",
	"created": 1686935002,
	"owned_by": "openai",
	"name": "GPT-4",
	"description": "GPT-4 model with advanced reasoning capabilities",
	"context_length": 8192,
	"max_output_tokens": 4096,
	"capabilities": {
	"vision": false,
	"tool_call": true,
	"reasoning": true
	},
	"pricing": {
	"input": 30.0,
	"output": 60.0,
	"cache_read": 15.0,
	"cache_write": 30.0,
	"unit": "per_1m_tokens",
	"currency": "USD"
	},
	"icon": "https://example.com/icon.png",
	"type": "chat"
	}
	]
	}
	```

	Extended Fields:
	- `name` - Human-readable model name
	- `description` - Detailed model description
	- `context_length` - Maximum tokens in context window
	- `max_output_tokens` - Maximum tokens in response
	- `capabilities` - Object with boolean flags:
	- `vision` - Supports image inputs
	- `tool_call` - Supports function calling
	- `reasoning` - Supports advanced reasoning
	- `pricing` - Object with pricing details:
	- `input` - Input token price per 1M tokens
	- `output` - Output token price per 1M tokens
	- `cache_read` - Cache read price per 1M tokens
	- `cache_write` - Cache write price per 1M tokens
	- `unit` - Always "per_1m_tokens"
	- `currency` - Always "USD"
	- `icon` - URL to model icon image
	- `type` - Model category (chat, embedding, image, rerank, moderation, tts, stt)

	### Examples

	Basic Request (Default):
	```bash
	curl -s http://localhost:8090/v1/models \
	-H "Authorization: Bearer your-api-key" \| jq
	```

	Include All Extended Fields:
	```bash
	curl -s "http://localhost:8090/v1/models?include=all" \
	-H "Authorization: Bearer your-api-key" \| jq
	```

	Selective Fields Only:
	```bash
	curl -s "http://localhost:8090/v1/models?include=name,pricing" \
	-H "Authorization: Bearer your-api-key" \| jq
	```

	OpenAI SDK (Python):
	```python
	import openai

	client = openai.OpenAI(
	api_key="your-axonhub-api-key",
	base_url="http://localhost:8090/v1"
	)

	# Get models with extended metadata
	models = client.models.list()
	for model in models.data:
	print(f"Model: {model.id}")
	# Access extended fields if available
	if hasattr(model, 'name'):
	print(f" Name: {model.name}")
	if hasattr(model, 'pricing'):
	print(f" Input price: ${model.pricing.input}/1M tokens")
	```

	### Error Responses

	401 Unauthorized - Invalid API Key:
	```json
	{
	"error": {
	"message": "Invalid API key",
	"type": "invalid_request_error",
	"code": "invalid_api_key"
	}
	}
	```

	500 Internal Server Error:
	```json
	{
	"error": {
	"message": "Internal server error",
	"type": "internal_error",
	"code": "internal_error"
	}
	}
	```

	### Field Availability Note

	> Note: Extended fields are only populated if the model has ModelCard data configured in the database. Models without ModelCard data will return `null` for extended fields.

	## Authentication

	The OpenAI API format uses Bearer token authentication:

	- Header: `Authorization: Bearer <your-api-key>`

	The API keys are managed through AxonHub's API Key management system.

	## Streaming Support

	OpenAI API format supports streaming responses:

	```go
	// OpenAI SDK streaming
	completion, err := client.Chat.Completions.New(ctx, openai.ChatCompletionNewParams{
	Messages: []openai.ChatCompletionMessageParamUnion{
	openai.UserMessage("Write a short story about AI"),
	},
	Model: openai.ChatModel("claude-3-5-sonnet"),
	Stream: openai.Bool(true),
	})
	if err != nil {
	panic(err)
	}

	// Iterate over streaming chunks
	for completion.Next() {
	chunk := completion.Current()
	if len(chunk.Choices) > 0 && chunk.Choices[0].Delta.Content != "" {
	fmt.Print(chunk.Choices[0].Delta.Content)
	}
	}

	if err := completion.Err(); err != nil {
	panic(err)
	}
	```

	## Error Handling

	OpenAI format error responses:

	```json
	{
	"error": {
	"message": "Invalid API key",
	"type": "invalid_request_error",
	"code": "invalid_api_key"
	}
	}
	```

	## Tool Support

	AxonHub supports function tools (custom function calling) through the OpenAI API format. However, provider-specific tools are not supported:

	\| Tool Type \| Support Status \| Notes \|
	\| --------- \| -------------- \| ----- \|
	\| Function Tools \| ✅ Supported \| Custom function definitions work across all providers \|
	\| Web Search \| ❌ Not Supported \| Provider-specific (OpenAI, Anthropic, etc.) \|
	\| Code Interpreter \| ❌ Not Supported \| Provider-specific (OpenAI, Anthropic, etc.) \|
	\| File Search \| ❌ Not Supported \| Provider-specific \|
	\| Computer Use \| ❌ Not Supported \| Anthropic-specific \|

	> Note: Only generic function tools that can be translated across providers are supported. Provider-specific tools like web search, code interpreter, and computer use require direct access to the provider's infrastructure and cannot be proxied through AxonHub.

	## Best Practices

	1. Use Tracing Headers: Include `AH-Trace-Id` and `AH-Thread-Id` headers for better observability
	2. Model Selection: Specify the target model explicitly in your requests
	3. Error Handling: Implement proper error handling for API responses
	4. Streaming: Use streaming for better user experience with long responses
	5. Use Function Tools: For tool calling, use generic function tools instead of provider-specific tools

	## Migration Guide

	### From OpenAI to AxonHub
	```go
	// Before: Direct OpenAI
	client := openai.NewClient(
	option.WithAPIKey("openai-key"),
	)

	// After: AxonHub with OpenAI API
	client := openai.NewClient(
	option.WithAPIKey("axonhub-api-key"),
	option.WithBaseURL("http://localhost:8090/v1"),
	)
	// Your existing code continues to work!
	```