# API 使用文档 本文档介绍如何使用 Antigravity2API 提供的 OpenAI 兼容 API。 ## 基础配置 所有 API 请求需要在 Header 中携带 API Key: ``` Authorization: Bearer YOUR_API_KEY ``` 默认服务地址:`http://localhost:8045` ## 目录 - [获取模型列表](#获取模型列表) - [聊天补全](#聊天补全) - [工具调用](#工具调用function-calling) - [图片输入](#图片输入多模态) - [图片生成](#图片生成) - [思维链模型](#思维链模型) - [SD WebUI 兼容 API](#sd-webui-兼容-api) - [管理 API](#管理-api) - [使用示例](#使用示例) ## 获取模型列表 ```bash curl http://localhost:8045/v1/models \ -H "Authorization: Bearer sk-text" ``` **说明**:模型列表会缓存 1 小时(可通过 `config.json` 的 `cache.modelListTTL` 配置),减少 API 请求。 ## 聊天补全 ### 流式响应 ```bash curl http://localhost:8045/v1/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer sk-text" \ -d '{ "model": "gemini-2.0-flash-exp", "messages": [{"role": "user", "content": "你好"}], "stream": true }' ``` ### 非流式响应 ```bash curl http://localhost:8045/v1/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer sk-text" \ -d '{ "model": "gemini-2.0-flash-exp", "messages": [{"role": "user", "content": "你好"}], "stream": false }' ``` ## 工具调用(Function Calling) ```bash curl http://localhost:8045/v1/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer sk-text" \ -d '{ "model": "gemini-2.0-flash-exp", "messages": [{"role": "user", "content": "北京天气怎么样"}], "tools": [{ "type": "function", "function": { "name": "get_weather", "description": "获取天气信息", "parameters": { "type": "object", "properties": { "location": {"type": "string", "description": "城市名称"} }, "required": ["location"] } } }] }' ``` ## 图片输入(多模态) 支持 Base64 编码的图片输入,兼容 OpenAI 的多模态格式: ```bash curl http://localhost:8045/v1/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer sk-text" \ -d '{ "model": "gemini-2.0-flash-exp", "messages": [{ "role": "user", "content": [ {"type": "text", "text": "这张图片里有什么?"}, { "type": "image_url", "image_url": { "url": "..." } } ] }], "stream": true }' ``` ### 支持的图片格式 - JPEG/JPG (`data:image/jpeg;base64,...`) - PNG (`data:image/png;base64,...`) - GIF (`data:image/gif;base64,...`) - WebP (`data:image/webp;base64,...`) ## 图片生成 支持使用 `gemini-3-pro-image` 模型生成图片,生成的图片会以 Markdown 格式返回: ```bash curl http://localhost:8045/v1/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer sk-text" \ -d '{ "model": "gemini-3-pro-image", "messages": [{"role": "user", "content": "画一只可爱的猫"}], "stream": false }' ``` **响应示例**: ```json { "choices": [{ "message": { "role": "assistant", "content": "![image](http://localhost:8045/images/abc123.png)" } }] } ``` **注意**: - 生成的图片会保存到 `public/images/` 目录 - 需要配置 `IMAGE_BASE_URL` 环境变量以返回正确的图片 URL ## 请求参数说明 | 参数 | 类型 | 必填 | 说明 | |------|------|------|------| | `model` | string | ✅ | 模型名称 | | `messages` | array | ✅ | 对话消息列表 | | `stream` | boolean | ❌ | 是否流式响应,默认 false | | `temperature` | number | ❌ | 温度参数,默认 1 | | `top_p` | number | ❌ | Top P 参数,默认 1 | | `top_k` | number | ❌ | Top K 参数,默认 50 | | `max_tokens` | number | ❌ | 最大 token 数,默认 32000 | | `thinking_budget` | number | ❌ | 思考预算(仅对思考模型生效),可为 0 或 1024-32000,默认 1024(0 表示关闭思考预算限制) | | `reasoning_effort` | string | ❌ | 思维链强度(OpenAI 格式),可选值:`low`(1024)、`medium`(16000)、`high`(32000) | | `tools` | array | ❌ | 工具列表(Function Calling) | ## 响应格式 ### 非流式响应 ```json { "id": "chatcmpl-xxx", "object": "chat.completion", "created": 1234567890, "model": "gemini-2.0-flash-exp", "choices": [{ "index": 0, "message": { "role": "assistant", "content": "你好!有什么我可以帮助你的吗?" }, "finish_reason": "stop" }], "usage": { "prompt_tokens": 10, "completion_tokens": 20, "total_tokens": 30 } } ``` ### 流式响应 ``` data: {"id":"chatcmpl-xxx","object":"chat.completion.chunk","created":1234567890,"model":"gemini-2.0-flash-exp","choices":[{"index":0,"delta":{"role":"assistant","content":"你"},"finish_reason":null}]} data: {"id":"chatcmpl-xxx","object":"chat.completion.chunk","created":1234567890,"model":"gemini-2.0-flash-exp","choices":[{"index":0,"delta":{"content":"好"},"finish_reason":null}]} data: [DONE] ``` ## 错误处理 API 返回标准的 HTTP 状态码: | 状态码 | 说明 | |--------|------| | 200 | 请求成功 | | 400 | 请求参数错误 | | 401 | API Key 无效 | | 429 | 请求过于频繁 | | 500 | 服务器内部错误 | 错误响应格式: ```json { "error": { "message": "错误信息", "type": "invalid_request_error", "code": "invalid_api_key" } } ``` ## 思维链模型 对于支持思维链的模型(如 `gemini-2.5-pro`、`claude-opus-4-5-thinking` 等),可以通过以下参数控制推理深度: ### 使用 reasoning_effort(OpenAI 兼容格式) ```bash curl http://localhost:8045/v1/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer sk-text" \ -d '{ "model": "gemini-2.5-pro", "messages": [{"role": "user", "content": "解释量子纠缠"}], "stream": true, "reasoning_effort": "high" }' ``` | reasoning_effort | thinking_budget | 说明 | |-----------------|-----------------|------| | `low` | 1024 | 快速响应,适合简单问题(默认) | | `medium` | 16000 | 平衡模式 | | `high` | 32000 | 深度思考,适合复杂推理 | ### 使用 thinking_budget(直接数值) ```bash curl http://localhost:8045/v1/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer sk-text" \ -d '{ "model": "gemini-2.5-pro", "messages": [{"role": "user", "content": "证明勾股定理"}], "stream": true, "thinking_budget": 24000 }' ``` ### 429 自动重试配置 所有 429 重试次数仅通过服务端配置控制: - 全局默认重试次数(服务端配置): - 文件:`config.json` 中的 `other.retryTimes` - 示例: ```json "other": { "timeout": 300000, "retryTimes": 3, "skipProjectIdFetch": false, "useNativeAxios": false } ``` - 服务器始终使用这里配置的值作为 429 时的重试次数(默认 3 次)。 ### 思维链响应格式 思维链内容通过 `reasoning_content` 字段输出(兼容 DeepSeek 格式): **非流式响应**: ```json { "choices": [{ "message": { "role": "assistant", "reasoning_content": "让我思考一下这个问题...", "content": "量子纠缠是..." } }] } ``` **流式响应**: ``` data: {"choices":[{"delta":{"reasoning_content":"让我"}}]} data: {"choices":[{"delta":{"reasoning_content":"思考..."}}]} data: {"choices":[{"delta":{"content":"量子纠缠是..."}}]} ``` ### 支持思维链的模型 - `gemini-2.5-pro` - `gemini-2.5-flash-thinking` - `gemini-3-pro-high` - `gemini-3-pro-low` - `claude-opus-4-5-thinking` - `claude-sonnet-4-5-thinking` - `rev19-uic3-1p` - `gpt-oss-120b-medium` ## SD WebUI 兼容 API 本服务提供与 Stable Diffusion WebUI 兼容的 API 接口,可用于与支持 SD WebUI API 的客户端集成。 ### 文本生成图片 ```bash curl http://localhost:8045/sdapi/v1/txt2img \ -H "Content-Type: application/json" \ -d '{ "prompt": "a cute cat, high quality, detailed", "negative_prompt": "", "steps": 20, "width": 512, "height": 512 }' ``` ### 图片生成图片 ```bash curl http://localhost:8045/sdapi/v1/img2img \ -H "Content-Type: application/json" \ -d '{ "prompt": "enhance this image, high quality", "init_images": ["BASE64_ENCODED_IMAGE"], "steps": 20 }' ``` ### 其他 SD API 端点 | 端点 | 说明 | |------|------| | `GET /sdapi/v1/sd-models` | 获取可用的图片生成模型 | | `GET /sdapi/v1/options` | 获取当前选项 | | `GET /sdapi/v1/samplers` | 获取可用的采样器 | | `GET /sdapi/v1/upscalers` | 获取可用的放大器 | | `GET /sdapi/v1/progress` | 获取生成进度 | ## 管理 API 管理 API 需要 JWT 认证,先通过登录接口获取 token。 ### 登录 ```bash curl http://localhost:8045/admin/login \ -H "Content-Type: application/json" \ -d '{ "username": "admin", "password": "admin123" }' ``` ### Token 管理 ```bash # 获取 Token 列表 curl http://localhost:8045/admin/tokens \ -H "Authorization: Bearer JWT_TOKEN" # 添加 Token curl http://localhost:8045/admin/tokens \ -H "Content-Type: application/json" \ -H "Authorization: Bearer JWT_TOKEN" \ -d '{ "access_token": "ya29.xxx", "refresh_token": "1//xxx", "expires_in": 3599 }' # 删除 Token curl -X DELETE http://localhost:8045/admin/tokens/REFRESH_TOKEN \ -H "Authorization: Bearer JWT_TOKEN" ``` ### 查看模型额度 ```bash # 获取指定 Token 的模型额度 curl http://localhost:8045/admin/tokens/REFRESH_TOKEN/quotas \ -H "Authorization: Bearer JWT_TOKEN" # 强制刷新额度数据 curl "http://localhost:8045/admin/tokens/REFRESH_TOKEN/quotas?refresh=true" \ -H "Authorization: Bearer JWT_TOKEN" ``` **响应示例**: ```json { "success": true, "data": { "lastUpdated": 1702700000000, "models": { "gemini-2.5-pro": { "remaining": 0.85, "resetTime": "12-16 20:00", "resetTimeRaw": "2024-12-16T12:00:00Z" } } } } ``` ### 轮询策略配置 ```bash # 获取当前轮询配置 curl http://localhost:8045/admin/rotation \ -H "Authorization: Bearer JWT_TOKEN" # 更新轮询策略 curl -X PUT http://localhost:8045/admin/rotation \ -H "Content-Type: application/json" \ -H "Authorization: Bearer JWT_TOKEN" \ -d '{ "strategy": "request_count", "requestCount": 20 }' ``` **可用策略**: - `round_robin`:每次请求切换 Token - `quota_exhausted`:额度耗尽才切换 - `request_count`:自定义请求次数后切换 ### 配置管理 ```bash # 获取配置 curl http://localhost:8045/admin/config \ -H "Authorization: Bearer JWT_TOKEN" # 更新配置 curl -X PUT http://localhost:8045/admin/config \ -H "Content-Type: application/json" \ -H "Authorization: Bearer JWT_TOKEN" \ -d '{ "json": { "defaults": { "temperature": 0.7 } } }' ``` ## 使用示例 ### Python ```python import openai openai.api_base = "http://localhost:8045/v1" openai.api_key = "sk-text" response = openai.ChatCompletion.create( model="gemini-2.0-flash-exp", messages=[{"role": "user", "content": "你好"}], stream=True ) for chunk in response: print(chunk.choices[0].delta.get("content", ""), end="") ``` ### Node.js ```javascript import OpenAI from 'openai'; const openai = new OpenAI({ baseURL: 'http://localhost:8045/v1', apiKey: 'sk-text' }); const stream = await openai.chat.completions.create({ model: 'gemini-2.0-flash-exp', messages: [{ role: 'user', content: '你好' }], stream: true }); for await (const chunk of stream) { process.stdout.write(chunk.choices[0]?.delta?.content || ''); } ``` ## 配置选项 ### passSignatureToClient 控制是否将 `thoughtSignature` 透传到客户端响应中。 在 `config.json` 中配置: ```json { "other": { "passSignatureToClient": false } } ``` - `false`(默认):不透传签名,响应中不包含 `thoughtSignature` 字段 - `true`:透传签名,响应中包含 `thoughtSignature` 字段 **启用透传后的响应示例**: ```json { "choices": [{ "delta": { "reasoning_content": "让我思考...", "thoughtSignature": "RXFRRENrZ0lDaEFD..." } }] } ``` ### useContextSystemPrompt 控制是否将请求中的 system 消息合并到 SystemInstruction。 ```json { "other": { "useContextSystemPrompt": false } } ``` - `false`(默认):仅使用全局 `SYSTEM_INSTRUCTION` 环境变量 - `true`:将请求开头连续的 system 消息与全局配置合并 ## 注意事项 1. 所有 `/v1/*` 请求必须携带有效的 API Key 2. 管理 API (`/admin/*`) 需要 JWT 认证 3. 图片输入需要使用 Base64 编码 4. 流式响应使用 Server-Sent Events (SSE) 格式,包含心跳机制防止超时 5. 工具调用需要模型支持 Function Calling 6. 图片生成仅支持 `gemini-3-pro-image` 模型 7. 模型列表会缓存 1 小时,可通过配置调整 8. 思维链内容通过 `reasoning_content` 字段输出(兼容 DeepSeek 格式) 9. 默认轮询策略为 `request_count`,每 50 次请求切换 Token