Spaces:

llzai
/

axonhub

Sleeping

App Files Files Community

axonhub / docs /en /api-reference /openai-api.md

llzai

Upload 1793 files

9853396 verified about 1 month ago

preview code

raw

history blame contribute delete

16.5 kB

OpenAI API Reference

Overview

AxonHub provides full support for the OpenAI API specification, allowing you to use any OpenAI-compatible client SDK to access models from multiple providers.

Key Benefits

API Interoperability: Use OpenAI Chat Completions API to call Anthropic, Gemini, and other supported models
Zero Code Changes: Continue using your existing OpenAI client SDK without modification
Automatic Translation: AxonHub automatically converts between API formats when needed
Provider Flexibility: Access any supported AI provider using the OpenAI API format

Supported Endpoints

OpenAI Chat Completions API

Endpoints:

POST /v1/chat/completions - Text generation
GET /v1/models - List available models

Example Request:

import (
    "github.com/openai/openai-go/v3"
    "github.com/openai/openai-go/v3/option"
)

// Create OpenAI client with AxonHub configuration
client := openai.NewClient(
    option.WithAPIKey("your-axonhub-api-key"),
    option.WithBaseURL("http://localhost:8090/v1"),
    
)

// Call Anthropic model using OpenAI API format
completion, err := client.Chat.Completions.New(ctx, openai.ChatCompletionNewParams{
    Messages: []openai.ChatCompletionMessageParamUnion{
        openai.UserMessage("Hello, Claude!"),
    },
    Model: openai.ChatModel("claude-3-5-sonnet"),
},
    option.WithHeader("AH-Trace-Id", "trace-example-123"),
    option.WithHeader("AH-Thread-Id", "thread-example-abc"))
if err != nil {
    // Handle error appropriately
    panic(err)
}

// Access the response content
responseText := completion.Choices[0].Message.Content
fmt.Println(responseText)

OpenAI Responses API

AxonHub provides partial support for the OpenAI Responses API. This API offers a simplified interface for single-turn interactions.

Endpoints:

POST /v1/responses - Generate a response

Limitations:

❌ previous_response_id is not supported - conversation history must be managed client-side
✅ Basic response generation is fully functional
✅ Streaming responses are supported

Example Request:

import (
    "context"
    "fmt"

    "github.com/openai/openai-go/v3"
    "github.com/openai/openai-go/v3/option"
    "github.com/openai/openai-go/v3/responses"
    "github.com/openai/openai-go/v3/shared"
)

// Create OpenAI client with AxonHub configuration
client := openai.NewClient(
    option.WithAPIKey("your-axonhub-api-key"),
    option.WithBaseURL("http://localhost:8090/v1"),
)

ctx := context.Background()

// Generate a response (previous_response_id not supported)
params := responses.ResponseNewParams{
    Model: shared.ResponsesModel("gpt-4o"),
    Input: responses.ResponseNewParamsInputUnion{
        OfString: openai.String("Hello, how are you?"),
    },
}

response, err := client.Responses.New(ctx, params,
        option.WithHeader("AH-Trace-Id", "trace-example-123"),
        option.WithHeader("AH-Thread-Id", "thread-example-abc"))
if err != nil {
    panic(err)
}

fmt.Println(response.OutputText())

Example: Streaming Response

import (
    "context"
    "fmt"
    "strings"

    "github.com/openai/openai-go/v3"
    "github.com/openai/openai-go/v3/option"
    "github.com/openai/openai-go/v3/responses"
    "github.com/openai/openai-go/v3/shared"
)

client := openai.NewClient(
    option.WithAPIKey("your-axonhub-api-key"),
    option.WithBaseURL("http://localhost:8090/v1"),
)

ctx := context.Background()

params := responses.ResponseNewParams{
    Model: shared.ResponsesModel("gpt-4o"),
    Input: responses.ResponseNewParamsInputUnion{
        OfString: openai.String("Tell me a short story about a robot."),
    },
}

stream := client.Responses.NewStreaming(ctx, params,
        option.WithHeader("AH-Trace-Id", "trace-example-123"),
        option.WithHeader("AH-Thread-Id", "thread-example-abc"))

var fullContent strings.Builder
for stream.Next() {
    event := stream.Current()
    if event.Type == "response.output_text.delta" && event.Delta != "" {
        fullContent.WriteString(event.Delta)
        fmt.Print(event.Delta) // Print as it streams
    }
}

if err := stream.Err(); err != nil {
    panic(err)
}

fmt.Println("\nComplete response:", fullContent.String())

API Translation Capabilities

AxonHub automatically translates between API formats, enabling powerful scenarios:

Use OpenAI SDK with Anthropic Models

// OpenAI SDK calling Anthropic model
completion, err := client.Chat.Completions.New(ctx, openai.ChatCompletionNewParams{
    Messages: []openai.ChatCompletionMessageParamUnion{
        openai.UserMessage("Tell me about artificial intelligence"),
    },
    Model: openai.ChatModel("claude-3-5-sonnet"),  // Anthropic model
})

// Access response
responseText := completion.Choices[0].Message.Content
fmt.Println(responseText)
// AxonHub automatically translates OpenAI format → Anthropic format

Use OpenAI SDK with Gemini Models

// OpenAI SDK calling Gemini model
completion, err := client.Chat.Completions.New(ctx, openai.ChatCompletionNewParams{
    Messages: []openai.ChatCompletionMessageParamUnion{
        openai.UserMessage("Explain neural networks"),
    },
    Model: openai.ChatModel("gemini-2.5"),  // Gemini model
})

// Access response
responseText := completion.Choices[0].Message.Content
fmt.Println(responseText)
// AxonHub automatically translates OpenAI format → Gemini format

Embedding API

AxonHub provides comprehensive support for text and multimodal embedding generation through OpenAI-compatible API.

Endpoints:

POST /v1/embeddings - OpenAI-compatible embedding API

Supported Input Types:

Single text string
Array of text strings
Token arrays (integers)
Multiple token arrays

Supported Encoding Formats:

float - Default, returns embedding vectors as float arrays
base64 - Returns embeddings as base64-encoded strings

Request Format

{
  "input": "The text to embed",
  "model": "text-embedding-3-small",
  "encoding_format": "float",
  "dimensions": 1536,
  "user": "user-id"
}

Parameters:

Parameter	Type	Required	Description
`input`	string \| string[] \| number[] \| number[][]	✅	The text(s) to embed. Can be a single string, array of strings, token array, or multiple token arrays.
`model`	string	✅	The model to use for embedding generation.
`encoding_format`	string	❌	Format to return embeddings in. Either `float` or `base64`. Default: `float`.
`dimensions`	integer	❌	Number of dimensions for the output embeddings.
`user`	string	❌	Unique identifier for the end-user.

Response Format

{
  "object": "list",
  "data": [
    {
      "object": "embedding",
      "embedding": [0.123, 0.456, ...],
      "index": 0
    }
  ],
  "model": "text-embedding-3-small",
  "usage": {
    "prompt_tokens": 4,
    "total_tokens": 4
  }
}

Examples

OpenAI SDK (Python):

import openai

client = openai.OpenAI(
    api_key="your-axonhub-api-key",
    base_url="http://localhost:8090/v1"
)

response = client.embeddings.create(
    input="Hello, world!",
    model="text-embedding-3-small"
)

print(response.data[0].embedding[:5])  # First 5 dimensions

OpenAI SDK (Go):

package main

import (
    "context"
    "fmt"
    "log"

    "github.com/openai/openai-go"
    "github.com/openai/openai-go/option"
)

func main() {
    client := openai.NewClient(
        option.WithAPIKey("your-axonhub-api-key"),
        option.WithBaseURL("http://localhost:8090/v1"),
    )

    embedding, err := client.Embeddings.New(context.TODO(), openai.EmbeddingNewParams{
        Input: openai.Union[string](openai.String("Hello, world!")),
        Model: openai.String("text-embedding-3-small"),
        option.WithHeader("AH-Trace-Id", "trace-example-123"),
        option.WithHeader("AH-Thread-Id", "thread-example-abc"),
    })
    if err != nil {
        log.Fatal(err)
    }

    fmt.Printf("Embedding dimensions: %d\n", len(embedding.Data[0].Embedding))
    fmt.Printf("First 5 values: %v\n", embedding.Data[0].Embedding[:5])
}

Multiple Texts:

response = client.embeddings.create(
    input=["Hello, world!", "How are you?"],
    model="text-embedding-3-small"
)

for i, data in enumerate(response.data):
    print(f"Text {i}: {data.embedding[:3]}...")

Models API

AxonHub provides an enhanced /v1/models endpoint that lists available models with optional extended metadata.

Supported Endpoints

Endpoints:

GET /v1/models - List available models

Query Parameters

Parameter	Type	Required	Description
`include`	string	❌	Comma-separated list of fields to include, or "all" for all extended fields

Available Fields for Include

Field	Type	Description
`name`	string	Display name of the model
`description`	string	Model description
`context_length`	integer	Maximum context length in tokens
`max_output_tokens`	integer	Maximum output tokens
`capabilities`	object	Model capabilities (vision, tool_call, reasoning)
`pricing`	object	Pricing information (input, output, cache_read, cache_write)
`icon`	string	Model icon URL
`type`	string	Model type (chat, embedding, image, rerank, moderation, tts, stt)

Response Format (Basic - Default)

When called without the include parameter, the endpoint returns only basic fields:

{
  "object": "list",
  "data": [
    {
      "id": "gpt-4",
      "object": "model",
      "created": 1686935002,
      "owned_by": "openai"
    }
  ]
}

Fields:

id - Model identifier
object - Always "model"
created - Unix timestamp of model creation
owned_by - Organization that owns the model

Response Format (Extended)

When using ?include=all or selective fields, the response includes extended metadata:

{
  "object": "list",
  "data": [
    {
      "id": "gpt-4",
      "object": "model",
      "created": 1686935002,
      "owned_by": "openai",
      "name": "GPT-4",
      "description": "GPT-4 model with advanced reasoning capabilities",
      "context_length": 8192,
      "max_output_tokens": 4096,
      "capabilities": {
        "vision": false,
        "tool_call": true,
        "reasoning": true
      },
      "pricing": {
        "input": 30.0,
        "output": 60.0,
        "cache_read": 15.0,
        "cache_write": 30.0,
        "unit": "per_1m_tokens",
        "currency": "USD"
      },
      "icon": "https://example.com/icon.png",
      "type": "chat"
    }
  ]
}

Extended Fields:

name - Human-readable model name
description - Detailed model description
context_length - Maximum tokens in context window
max_output_tokens - Maximum tokens in response
capabilities - Object with boolean flags:
- vision - Supports image inputs
- tool_call - Supports function calling
- reasoning - Supports advanced reasoning
pricing - Object with pricing details:
- input - Input token price per 1M tokens
- output - Output token price per 1M tokens
- cache_read - Cache read price per 1M tokens
- cache_write - Cache write price per 1M tokens
- unit - Always "per_1m_tokens"
- currency - Always "USD"
icon - URL to model icon image
type - Model category (chat, embedding, image, rerank, moderation, tts, stt)

Examples

Basic Request (Default):

curl -s http://localhost:8090/v1/models \
  -H "Authorization: Bearer your-api-key" | jq

Include All Extended Fields:

curl -s "http://localhost:8090/v1/models?include=all" \
  -H "Authorization: Bearer your-api-key" | jq

Selective Fields Only:

curl -s "http://localhost:8090/v1/models?include=name,pricing" \
  -H "Authorization: Bearer your-api-key" | jq

OpenAI SDK (Python):

import openai

client = openai.OpenAI(
    api_key="your-axonhub-api-key",
    base_url="http://localhost:8090/v1"
)

# Get models with extended metadata
models = client.models.list()
for model in models.data:
    print(f"Model: {model.id}")
    # Access extended fields if available
    if hasattr(model, 'name'):
        print(f"  Name: {model.name}")
    if hasattr(model, 'pricing'):
        print(f"  Input price: ${model.pricing.input}/1M tokens")

Error Responses

401 Unauthorized - Invalid API Key:

{
  "error": {
    "message": "Invalid API key",
    "type": "invalid_request_error",
    "code": "invalid_api_key"
  }
}

500 Internal Server Error:

{
  "error": {
    "message": "Internal server error",
    "type": "internal_error",
    "code": "internal_error"
  }
}

Field Availability Note

Note: Extended fields are only populated if the model has ModelCard data configured in the database. Models without ModelCard data will return null for extended fields.

Authentication

The OpenAI API format uses Bearer token authentication:

Header: Authorization: Bearer <your-api-key>

The API keys are managed through AxonHub's API Key management system.

Streaming Support

OpenAI API format supports streaming responses:

// OpenAI SDK streaming
completion, err := client.Chat.Completions.New(ctx, openai.ChatCompletionNewParams{
    Messages: []openai.ChatCompletionMessageParamUnion{
        openai.UserMessage("Write a short story about AI"),
    },
    Model:  openai.ChatModel("claude-3-5-sonnet"),
    Stream: openai.Bool(true),
})
if err != nil {
    panic(err)
}

// Iterate over streaming chunks
for completion.Next() {
    chunk := completion.Current()
    if len(chunk.Choices) > 0 && chunk.Choices[0].Delta.Content != "" {
        fmt.Print(chunk.Choices[0].Delta.Content)
    }
}

if err := completion.Err(); err != nil {
    panic(err)
}

Error Handling

OpenAI format error responses:

{
  "error": {
    "message": "Invalid API key",
    "type": "invalid_request_error",
    "code": "invalid_api_key"
  }
}

Tool Support

AxonHub supports function tools (custom function calling) through the OpenAI API format. However, provider-specific tools are not supported:

Tool Type	Support Status	Notes
Function Tools	✅ Supported	Custom function definitions work across all providers
Web Search	❌ Not Supported	Provider-specific (OpenAI, Anthropic, etc.)
Code Interpreter	❌ Not Supported	Provider-specific (OpenAI, Anthropic, etc.)
File Search	❌ Not Supported	Provider-specific
Computer Use	❌ Not Supported	Anthropic-specific

Note: Only generic function tools that can be translated across providers are supported. Provider-specific tools like web search, code interpreter, and computer use require direct access to the provider's infrastructure and cannot be proxied through AxonHub.

Best Practices

Use Tracing Headers: Include AH-Trace-Id and AH-Thread-Id headers for better observability
Model Selection: Specify the target model explicitly in your requests
Error Handling: Implement proper error handling for API responses
Streaming: Use streaming for better user experience with long responses
Use Function Tools: For tool calling, use generic function tools instead of provider-specific tools

Migration Guide

From OpenAI to AxonHub

// Before: Direct OpenAI
client := openai.NewClient(
    option.WithAPIKey("openai-key"),
)

// After: AxonHub with OpenAI API
client := openai.NewClient(
    option.WithAPIKey("axonhub-api-key"),
    option.WithBaseURL("http://localhost:8090/v1"),
)
// Your existing code continues to work!