anti_api / API.md
ZhaoShanGeng
docs: 更新文档和打包配置
eab61c2

API 使用文档

本文档介绍如何使用 Antigravity2API 提供的 OpenAI 兼容 API。

基础配置

所有 API 请求需要在 Header 中携带 API Key:

Authorization: Bearer YOUR_API_KEY

默认服务地址:http://localhost:8045

目录

获取模型列表

curl http://localhost:8045/v1/models \
  -H "Authorization: Bearer sk-text"

说明:模型列表会缓存 1 小时(可通过 config.jsoncache.modelListTTL 配置),减少 API 请求。

聊天补全

流式响应

curl http://localhost:8045/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-text" \
  -d '{
    "model": "gemini-2.0-flash-exp",
    "messages": [{"role": "user", "content": "你好"}],
    "stream": true
  }'

非流式响应

curl http://localhost:8045/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-text" \
  -d '{
    "model": "gemini-2.0-flash-exp",
    "messages": [{"role": "user", "content": "你好"}],
    "stream": false
  }'

工具调用(Function Calling)

curl http://localhost:8045/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-text" \
  -d '{
    "model": "gemini-2.0-flash-exp",
    "messages": [{"role": "user", "content": "北京天气怎么样"}],
    "tools": [{
      "type": "function",
      "function": {
        "name": "get_weather",
        "description": "获取天气信息",
        "parameters": {
          "type": "object",
          "properties": {
            "location": {"type": "string", "description": "城市名称"}
          },
          "required": ["location"]
        }
      }
    }]
  }'

图片输入(多模态)

支持 Base64 编码的图片输入,兼容 OpenAI 的多模态格式:

curl http://localhost:8045/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-text" \
  -d '{
    "model": "gemini-2.0-flash-exp",
    "messages": [{
      "role": "user",
      "content": [
        {"type": "text", "text": "这张图片里有什么?"},
        {
          "type": "image_url",
          "image_url": {
            "url": "..."
          }
        }
      ]
    }],
    "stream": true
  }'

支持的图片格式

  • JPEG/JPG (data:image/jpeg;base64,...)
  • PNG (data:image/png;base64,...)
  • GIF (data:image/gif;base64,...)
  • WebP (data:image/webp;base64,...)

图片生成

支持使用 gemini-3-pro-image 模型生成图片,生成的图片会以 Markdown 格式返回:

curl http://localhost:8045/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-text" \
  -d '{
    "model": "gemini-3-pro-image",
    "messages": [{"role": "user", "content": "画一只可爱的猫"}],
    "stream": false
  }'

响应示例

{
  "choices": [{
    "message": {
      "role": "assistant",
      "content": "![image](http://localhost:8045/images/abc123.png)"
    }
  }]
}

注意

  • 生成的图片会保存到 public/images/ 目录
  • 需要配置 IMAGE_BASE_URL 环境变量以返回正确的图片 URL

请求参数说明

参数 类型 必填 说明
model string 模型名称
messages array 对话消息列表
stream boolean 是否流式响应,默认 false
temperature number 温度参数,默认 1
top_p number Top P 参数,默认 1
top_k number Top K 参数,默认 50
max_tokens number 最大 token 数,默认 32000
thinking_budget number 思考预算(仅对思考模型生效),可为 0 或 1024-32000,默认 1024(0 表示关闭思考预算限制)
reasoning_effort string 思维链强度(OpenAI 格式),可选值:low(1024)、medium(16000)、high(32000)
tools array 工具列表(Function Calling)

响应格式

非流式响应

{
  "id": "chatcmpl-xxx",
  "object": "chat.completion",
  "created": 1234567890,
  "model": "gemini-2.0-flash-exp",
  "choices": [{
    "index": 0,
    "message": {
      "role": "assistant",
      "content": "你好!有什么我可以帮助你的吗?"
    },
    "finish_reason": "stop"
  }],
  "usage": {
    "prompt_tokens": 10,
    "completion_tokens": 20,
    "total_tokens": 30
  }
}

流式响应

data: {"id":"chatcmpl-xxx","object":"chat.completion.chunk","created":1234567890,"model":"gemini-2.0-flash-exp","choices":[{"index":0,"delta":{"role":"assistant","content":"你"},"finish_reason":null}]}

data: {"id":"chatcmpl-xxx","object":"chat.completion.chunk","created":1234567890,"model":"gemini-2.0-flash-exp","choices":[{"index":0,"delta":{"content":"好"},"finish_reason":null}]}

data: [DONE]

错误处理

API 返回标准的 HTTP 状态码:

状态码 说明
200 请求成功
400 请求参数错误
401 API Key 无效
429 请求过于频繁
500 服务器内部错误

错误响应格式:

{
  "error": {
    "message": "错误信息",
    "type": "invalid_request_error",
    "code": "invalid_api_key"
  }
}

思维链模型

对于支持思维链的模型(如 gemini-2.5-proclaude-opus-4-5-thinking 等),可以通过以下参数控制推理深度:

使用 reasoning_effort(OpenAI 兼容格式)

curl http://localhost:8045/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-text" \
  -d '{
    "model": "gemini-2.5-pro",
    "messages": [{"role": "user", "content": "解释量子纠缠"}],
    "stream": true,
    "reasoning_effort": "high"
  }'
reasoning_effort thinking_budget 说明
low 1024 快速响应,适合简单问题(默认)
medium 16000 平衡模式
high 32000 深度思考,适合复杂推理

使用 thinking_budget(直接数值)

curl http://localhost:8045/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-text" \
  -d '{
    "model": "gemini-2.5-pro",
    "messages": [{"role": "user", "content": "证明勾股定理"}],
      "stream": true,
    "thinking_budget": 24000
  }'

429 自动重试配置

所有 429 重试次数仅通过服务端配置控制:

  • 全局默认重试次数(服务端配置):
    • 文件:config.json 中的 other.retryTimes
    • 示例:
      "other": {
        "timeout": 300000,
        "retryTimes": 3,
        "skipProjectIdFetch": false,
        "useNativeAxios": false
      }
      
    • 服务器始终使用这里配置的值作为 429 时的重试次数(默认 3 次)。

思维链响应格式

思维链内容通过 reasoning_content 字段输出(兼容 DeepSeek 格式):

非流式响应

{
  "choices": [{
    "message": {
      "role": "assistant",
      "reasoning_content": "让我思考一下这个问题...",
      "content": "量子纠缠是..."
    }
  }]
}

流式响应

data: {"choices":[{"delta":{"reasoning_content":"让我"}}]}
data: {"choices":[{"delta":{"reasoning_content":"思考..."}}]}
data: {"choices":[{"delta":{"content":"量子纠缠是..."}}]}

支持思维链的模型

  • gemini-2.5-pro
  • gemini-2.5-flash-thinking
  • gemini-3-pro-high
  • gemini-3-pro-low
  • claude-opus-4-5-thinking
  • claude-sonnet-4-5-thinking
  • rev19-uic3-1p
  • gpt-oss-120b-medium

SD WebUI 兼容 API

本服务提供与 Stable Diffusion WebUI 兼容的 API 接口,可用于与支持 SD WebUI API 的客户端集成。

文本生成图片

curl http://localhost:8045/sdapi/v1/txt2img \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "a cute cat, high quality, detailed",
    "negative_prompt": "",
    "steps": 20,
    "width": 512,
    "height": 512
  }'

图片生成图片

curl http://localhost:8045/sdapi/v1/img2img \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "enhance this image, high quality",
    "init_images": ["BASE64_ENCODED_IMAGE"],
    "steps": 20
  }'

其他 SD API 端点

端点 说明
GET /sdapi/v1/sd-models 获取可用的图片生成模型
GET /sdapi/v1/options 获取当前选项
GET /sdapi/v1/samplers 获取可用的采样器
GET /sdapi/v1/upscalers 获取可用的放大器
GET /sdapi/v1/progress 获取生成进度

管理 API

管理 API 需要 JWT 认证,先通过登录接口获取 token。

登录

curl http://localhost:8045/admin/login \
  -H "Content-Type: application/json" \
  -d '{
    "username": "admin",
    "password": "admin123"
  }'

Token 管理

# 获取 Token 列表
curl http://localhost:8045/admin/tokens \
  -H "Authorization: Bearer JWT_TOKEN"

# 添加 Token
curl http://localhost:8045/admin/tokens \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer JWT_TOKEN" \
  -d '{
    "access_token": "ya29.xxx",
    "refresh_token": "1//xxx",
    "expires_in": 3599
  }'

# 删除 Token
curl -X DELETE http://localhost:8045/admin/tokens/REFRESH_TOKEN \
  -H "Authorization: Bearer JWT_TOKEN"

查看模型额度

# 获取指定 Token 的模型额度
curl http://localhost:8045/admin/tokens/REFRESH_TOKEN/quotas \
  -H "Authorization: Bearer JWT_TOKEN"

# 强制刷新额度数据
curl "http://localhost:8045/admin/tokens/REFRESH_TOKEN/quotas?refresh=true" \
  -H "Authorization: Bearer JWT_TOKEN"

响应示例

{
  "success": true,
  "data": {
    "lastUpdated": 1702700000000,
    "models": {
      "gemini-2.5-pro": {
        "remaining": 0.85,
        "resetTime": "12-16 20:00",
        "resetTimeRaw": "2024-12-16T12:00:00Z"
      }
    }
  }
}

轮询策略配置

# 获取当前轮询配置
curl http://localhost:8045/admin/rotation \
  -H "Authorization: Bearer JWT_TOKEN"

# 更新轮询策略
curl -X PUT http://localhost:8045/admin/rotation \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer JWT_TOKEN" \
  -d '{
    "strategy": "request_count",
    "requestCount": 20
  }'

可用策略

  • round_robin:每次请求切换 Token
  • quota_exhausted:额度耗尽才切换
  • request_count:自定义请求次数后切换

配置管理

# 获取配置
curl http://localhost:8045/admin/config \
  -H "Authorization: Bearer JWT_TOKEN"

# 更新配置
curl -X PUT http://localhost:8045/admin/config \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer JWT_TOKEN" \
  -d '{
    "json": {
      "defaults": {
        "temperature": 0.7
      }
    }
  }'

使用示例

Python

import openai

openai.api_base = "http://localhost:8045/v1"
openai.api_key = "sk-text"

response = openai.ChatCompletion.create(
    model="gemini-2.0-flash-exp",
    messages=[{"role": "user", "content": "你好"}],
    stream=True
)

for chunk in response:
    print(chunk.choices[0].delta.get("content", ""), end="")

Node.js

import OpenAI from 'openai';

const openai = new OpenAI({
  baseURL: 'http://localhost:8045/v1',
  apiKey: 'sk-text'
});

const stream = await openai.chat.completions.create({
  model: 'gemini-2.0-flash-exp',
  messages: [{ role: 'user', content: '你好' }],
  stream: true
});

for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content || '');
}

配置选项

passSignatureToClient

控制是否将 thoughtSignature 透传到客户端响应中。

config.json 中配置:

{
  "other": {
    "passSignatureToClient": false
  }
}
  • false(默认):不透传签名,响应中不包含 thoughtSignature 字段
  • true:透传签名,响应中包含 thoughtSignature 字段

启用透传后的响应示例

{
  "choices": [{
    "delta": {
      "reasoning_content": "让我思考...",
      "thoughtSignature": "RXFRRENrZ0lDaEFD..."
    }
  }]
}

useContextSystemPrompt

控制是否将请求中的 system 消息合并到 SystemInstruction。

{
  "other": {
    "useContextSystemPrompt": false
  }
}
  • false(默认):仅使用全局 SYSTEM_INSTRUCTION 环境变量
  • true:将请求开头连续的 system 消息与全局配置合并

注意事项

  1. 所有 /v1/* 请求必须携带有效的 API Key
  2. 管理 API (/admin/*) 需要 JWT 认证
  3. 图片输入需要使用 Base64 编码
  4. 流式响应使用 Server-Sent Events (SSE) 格式,包含心跳机制防止超时
  5. 工具调用需要模型支持 Function Calling
  6. 图片生成仅支持 gemini-3-pro-image 模型
  7. 模型列表会缓存 1 小时,可通过配置调整
  8. 思维链内容通过 reasoning_content 字段输出(兼容 DeepSeek 格式)
  9. 默认轮询策略为 request_count,每 50 次请求切换 Token