z2api

Sleeping

ZyphrZero commited on Sep 4, 2025

Commit

130b143

1 Parent(s): 1ba1e5f

✨ feat(models)：引入 GLM-4.5-Air 模型支持

- 为 GLM-4.5-Air 添加新的 AIR_MODEL 配置选项
- 根据请求的模型实现动态上游模型 ID 映射
- 更新模型列表，包含 GLM-4.5-Air 和 0727-106B-API 上游 ID
- 增强模型架构，支持多种上游模型映射

✨ feat(config)：添加跳过身份验证令牌选项

- 添加 SKIP_AUTH_TOKEN 环境变量以绕过 API 密钥验证
- 更新配置设置，包含新的身份验证绕过选项
- 修改身份验证逻辑，以便有条件地验证 API 令牌

📝 docs(readme)：更新文档，添加新功能

- 添加显示 1.1.0 版本的版本徽章
- 在高亮显示中包含多模型架构功能
- 更新模型表，包含上游 ID 列和 GLM-4.5-Air 模型
- 记录模型配置的新环境变量
- 在配置表中添加 SKIP_AUTH_TOKEN 和 BACKUP_TOKEN

Files changed (6) hide show

.env.example +2 -0
README.md +82 -18
app/api/anthropic.py +15 -12
app/api/openai.py +31 -14
app/core/config.py +2 -0
deploy/docker-compose.yml +2 -0

.env.example CHANGED Viewed

@@ -8,6 +8,8 @@
 # 客户端认证密钥（OpenAI 和 Anthropic 共用）
 # 客户端调用时需要使用此密钥进行认证
 AUTH_TOKEN=sk-your-api-key
 # Anthropic API 客户端认证密钥（可选）
 # 如果未设置，将使用 AUTH_TOKEN 的值

 # 客户端认证密钥（OpenAI 和 Anthropic 共用）
 # 客户端调用时需要使用此密钥进行认证
 AUTH_TOKEN=sk-your-api-key
+# 是否跳过api key验证
+SKIP_AUTH_TOKEN=false
 # Anthropic API 客户端认证密钥（可选）
 # 如果未设置，将使用 AUTH_TOKEN 的值

README.md CHANGED Viewed

@@ -3,6 +3,7 @@
 ![License: MIT](https://img.shields.io/badge/license-MIT-blue.svg)
 ![Python: 3.8+](https://img.shields.io/badge/python-3.8+-green.svg)
 ![FastAPI](https://img.shields.io/badge/framework-FastAPI-009688.svg)
 为 Z.AI 提供 OpenAI 和 Anthropic API 兼容接口的轻量级代理服务，支持 GLM-4.5 系列模型的完整功能。
@@ -17,6 +18,7 @@
 - 🐳 **Docker 部署** - 一键容器化部署
 - 🛡️ **会话隔离** - 匿名模式保护隐私
 - 🔧 **高度可配置** - 环境变量灵活配置
 ## 🚀 快速开始
@@ -101,11 +103,12 @@ docker-compose up -d
 ### 支持的模型
-| 模型 | 描述 | 特性 |
-|------|------|------|
-| `GLM-4.5` | 标准模型 | 通用对话，平衡性能 |
-| `GLM-4.5-Thinking` | 思考模型 | 显示推理过程，透明度高 |
-| `GLM-4.5-Search` | 搜索模型 | 实时网络搜索，信息更新 |
 ### Function Call 功能
@@ -161,12 +164,20 @@ for chunk in response:
 | 变量名 | 默认值 | 说明 |
 |--------|--------|------|
 | `AUTH_TOKEN` | `sk-your-api-key` | 客户端认证密钥（OpenAI 和 Anthropic 共用） |
 | `API_ENDPOINT` | `https://chat.z.ai/api/chat/completions` | 上游 API 地址 |
 | `LISTEN_PORT` | `8080` | 服务监听端口 |
 | `DEBUG_LOGGING` | `true` | 调试日志开关 |
 | `THINKING_PROCESSING` | `think` | 思考内容处理策略 |
 | `ANONYMOUS_MODE` | `true` | 匿名模式开关 |
 | `TOOL_SUPPORT` | `true` | Function Call 功能开关 |
 ### 思考内容处理策略
@@ -199,7 +210,7 @@ def chat_with_ai(message):
 ### 2. 多模型对比测试
 ```python
-models = ["GLM-4.5", "GLM-4.5-Thinking", "GLM-4.5-Search"]
 for model in models:
     response = client.chat.completions.create(
@@ -248,26 +259,79 @@ A: 支持聊天完成、模型列表、流式响应、工具调用等核心功
 **Q: 支持 Anthropic API 的哪些功能？**
 A: 支持 messages 创建、流式响应、系统提示等核心功能。
 **Q: 如何自定义配置？**
 A: 通过环境变量配置，推荐使用 `.env` 文件。
 ## 🏗️ 技术架构
 ```
-┌──────────────┐     ┌─────────────┐     ┌─────────────┐
-│   OpenAI     │     │             │     │             │
-│  Client      │────▶│   Proxy     │────▶│    Z.AI     │
-└──────────────┘     │   Server    │     │    API      │
-┌──────────────┐     │             │     │             │
-│  Anthropic    │────▶│             │     │             │
-│  Client      │     │             │     │             │
-└──────────────┘     └─────────────┘     └─────────────┘
 ```
-- **FastAPI** - 高性能 Web 框架
-- **Pydantic** - 数据验证和序列化
-- **Uvicorn** - ASGI 服务器
-- **Requests** - HTTP 客户端
 ## 🤝 贡献指南

 ![License: MIT](https://img.shields.io/badge/license-MIT-blue.svg)
 ![Python: 3.8+](https://img.shields.io/badge/python-3.8+-green.svg)
 ![FastAPI](https://img.shields.io/badge/framework-FastAPI-009688.svg)
+![Version: 1.1.0](https://img.shields.io/badge/version-1.1.0-brightgreen.svg)
 为 Z.AI 提供 OpenAI 和 Anthropic API 兼容接口的轻量级代理服务，支持 GLM-4.5 系列模型的完整功能。
 - 🐳 **Docker 部署** - 一键容器化部署
 - 🛡️ **会话隔离** - 匿名模式保护隐私
 - 🔧 **高度可配置** - 环境变量灵活配置
+- 📊 **多模型架构** - 灵活的上游模型映射机制
 ## 🚀 快速开始
 ### 支持的模型
+| 模型 | 上游ID | 描述 | 特性 |
+|------|--------|------|------|
+| `GLM-4.5` | 0727-360B-API | 标准模型 | 通用对话，平衡性能 |
+| `GLM-4.5-Thinking` | 0727-360B-API | 思考模型 | 显示推理过程，透明度高 |
+| `GLM-4.5-Search` | 0727-360B-API | 搜索模型 | 实时网络搜索，信息更新 |
+| `GLM-4.5-Air` | 0727-106B-API | 轻量模型 | 快速响应，高效推理 |
 ### Function Call 功能
 | 变量名 | 默认值 | 说明 |
 |--------|--------|------|
 | `AUTH_TOKEN` | `sk-your-api-key` | 客户端认证密钥（OpenAI 和 Anthropic 共用） |
+| `ANTHROPIC_API_KEY` | `sk-your-api-key` | Anthropic API 认证密钥（默认使用 AUTH_TOKEN） |
 | `API_ENDPOINT` | `https://chat.z.ai/api/chat/completions` | 上游 API 地址 |
 | `LISTEN_PORT` | `8080` | 服务监听端口 |
+| `PRIMARY_MODEL` | `GLM-4.5` | 主要模型名称 |
+| `THINKING_MODEL` | `GLM-4.5-Thinking` | 思考模型名称 |
+| `SEARCH_MODEL` | `GLM-4.5-Search` | 搜索模型名称 |
+| `AIR_MODEL` | `GLM-4.5-Air` | Air 模型名称 |
 | `DEBUG_LOGGING` | `true` | 调试日志开关 |
 | `THINKING_PROCESSING` | `think` | 思考内容处理策略 |
 | `ANONYMOUS_MODE` | `true` | 匿名模式开关 |
 | `TOOL_SUPPORT` | `true` | Function Call 功能开关 |
+| `SKIP_AUTH_TOKEN` | `false` | 跳过认证令牌验证 |
+| `SCAN_LIMIT` | `200000` | 扫描限制 |
+| `BACKUP_TOKEN` | `eyJhbGciOiJFUzI1NiIsInR5cCI6IkpXVCJ9...` | 备用认证令牌 |
 ### 思考内容处理策略
 ### 2. 多模型对比测试
 ```python
+models = ["GLM-4.5", "GLM-4.5-Thinking", "GLM-4.5-Search", "GLM-4.5-Air"]
 for model in models:
     response = client.chat.completions.create(
 **Q: 支持 Anthropic API 的哪些功能？**
 A: 支持 messages 创建、流式响应、系统提示等核心功能。
+**Q: 如何选择合适的模型？**
+A:
+- **GLM-4.5**: 通用场景，性能和效果平衡
+- **GLM-4.5-Thinking**: 需要了解推理过程的场景
+- **GLM-4.5-Search**: 需要实时信息的场景
+- **GLM-4.5-Air**: 高并发、低延迟要求的场景
 **Q: 如何自定义配置？**
 A: 通过环境变量配置，推荐使用 `.env` 文件。
 ## 🏗️ 技术架构
 ```
+┌──────────────┐      ┌─────────────────────────┐      ┌─────────────────┐
+│   OpenAI     │      │                         │      │                 │
+│  Client      │────▶│    FastAPI Router       │────▶│   Z.AI API      │
+└──────────────┘      │                         │      │                 │
+┌──────────────┐      │ ┌─────────────────────┐ │      │ ┌─────────────┐ │
+│  Anthropic   │────▶│ │   OpenAI Endpoint   │ │      │ │0727-360B-API│ │
+│  Client      │      │ └─────────────────────┘ │      │ └─────────────┘ │
+└──────────────┘      │ ┌─────────────────────┐ │      │ ┌─────────────┐ │
+                      │ │  Anthropic Endpoint │ │────▶│ │0727-106B-API│ │
+                      │ └─────────────────────┘ │      │ └─────────────┘ │
+                      │ ┌─────────────────────┐ │      │                 │
+                      │ │   Models Endpoint   │ │      └─────────────────┘
+                      │ └─────────────────────┘ │
+                      └─────────────────────────┘
+                              Proxy Server
 ```
+### 核心组件
+- **FastAPI** - 高性能 Web 框架，支持异步处理
+- **Pydantic** - 数据验证和序列化，确保 API 兼容性
+- **Uvicorn** - ASGI 服务器，提供高性能服务
+- **Requests** - HTTP 客户端，与上游 API 通信
+### 架构特点
+- **模块化设计** - 清晰的目录结构，易于维护和扩展
+- **多协议支持** - 同时支持 OpenAI 和 Anthropic API 协议
+- **动态路由** - 根据请求模型自动选择上游服务
+- **流式处理** - 完整支持 SSE 流式响应
+- **类型安全** - 基于 Pydantic 的严格类型检查
+### 项目结构
+```
+z.ai2api_python/
+├── app/
+│   ├── api/
+│   │   ├── __init__.py
+│   │   ├── openai.py          # OpenAI API 路由
+│   │   └── anthropic.py       # Anthropic API 路由
+│   ├── core/
+│   │   ├── __init__.py
+│   │   ├── config.py          # 配置管理
+│   │   └── response_handlers.py  # 响应处理器
+│   ├── models/
+│   │   ├── __init__.py
+│   │   └── schemas.py         # 数据模型定义
+│   ├── utils/
+│   │   ├── __init__.py
+│   │   ├── helpers.py         # 工具函数
+│   │   ├── tools.py           # Function Call 处理
+│   │   └── sse_parser.py      # SSE 解析器
+│   └── __init__.py
+├── tests/                     # 测试文件
+├── deploy/                    # 部署配置
+├── main.py                    # 应用入口
+├── requirements.txt           # 依赖列表
+└── README.md                  # 项目文档
+```
 ## 🤝 贡献指南

app/api/anthropic.py CHANGED Viewed

@@ -115,18 +115,21 @@ async def handle_anthropic_message(
     """Handle Anthropic message requests"""
     debug_log("收到 Anthropic message 请求")
-    # 验证 API key
-    api_key = None
-    if x_api_key:
-        api_key = x_api_key
-    elif authorization and authorization.startswith("Bearer "):
-        api_key = authorization[7:]
-    if not api_key or api_key != settings.ANTHROPIC_API_KEY:
-        debug_log(f"无效的 API key: {api_key}")
-        raise HTTPException(status_code=401, detail="Invalid API key")
-    debug_log(f"API key 验证通过")
     debug_log(f"请求解析成功 - 模型: {req.model}, 流式: {req.stream}, 消息数: {len(req.messages)}")
     # 确定上游模型和功能

     """Handle Anthropic message requests"""
     debug_log("收到 Anthropic message 请求")
+    # 验证 API key (skip if SKIP_AUTH_TOKEN is enabled)
+    if not settings.SKIP_AUTH_TOKEN:
+        api_key = None
+        if x_api_key:
+            api_key = x_api_key
+        elif authorization and authorization.startswith("Bearer "):
+            api_key = authorization[7:]
+        if not api_key or api_key != settings.ANTHROPIC_API_KEY:
+            debug_log(f"无效的 API key: {api_key}")
+            raise HTTPException(status_code=401, detail="Invalid API key")
+        debug_log(f"API key 验证通过")
+    else:
+        debug_log("SKIP_AUTH_TOKEN已启用，跳过API key验证")
     debug_log(f"请求解析成功 - 模型: {req.model}, 流式: {req.stream}, 消息数: {len(req.messages)}")
     # 确定上游模型和功能

app/api/openai.py CHANGED Viewed

@@ -41,6 +41,11 @@ async def list_models():
                 created=current_time,
                 owned_by="z.ai"
             ),
         ]
     )
     return response
@@ -55,17 +60,20 @@ async def chat_completions(
     debug_log("收到chat completions请求")
     try:
-        # Validate API key
-        if not authorization.startswith("Bearer "):
-            debug_log("缺少或无效的Authorization头")
-            raise HTTPException(status_code=401, detail="Missing or invalid Authorization header")
-        api_key = authorization[7:]
-        if api_key != settings.AUTH_TOKEN:
-            debug_log(f"无效的API key: {api_key}")
-            raise HTTPException(status_code=401, detail="Invalid API key")
-        debug_log(f"API key验证通过，AUTH_TOKEN={api_key[:8]}......")
         debug_log(f"请求解析成功 - 模型: {request.model}, 流式: {request.stream}, 消息数: {len(request.messages)}")
         # Generate IDs
@@ -95,14 +103,23 @@ async def chat_completions(
         # Determine model features
         is_thinking = request.model == settings.THINKING_MODEL
         is_search = request.model == settings.SEARCH_MODEL
         search_mcp = "deep-web-search" if is_search else ""
         # Build upstream request
         upstream_req = UpstreamRequest(
             stream=True,  # Always use streaming from upstream
             chat_id=chat_id,
             id=msg_id,
-            model="0727-360B-API",  # Actual upstream model ID
             messages=upstream_messages,
             params={},
             features={
@@ -116,8 +133,8 @@ async def chat_completions(
             },
             mcp_servers=[search_mcp] if search_mcp else [],
             model_item=ModelItem(
-                id="0727-360B-API",
-                name="GLM-4.5",
                 owned_by="openai"
             ),
             tool_servers=[],

                 created=current_time,
                 owned_by="z.ai"
             ),
+            Model(
+                id=settings.AIR_MODEL,
+                created=current_time,
+                owned_by="z.ai"
+            ),
         ]
     )
     return response
     debug_log("收到chat completions请求")
     try:
+        # Validate API key (skip if SKIP_AUTH_TOKEN is enabled)
+        if not settings.SKIP_AUTH_TOKEN:
+            if not authorization.startswith("Bearer "):
+                debug_log("缺少或无效的Authorization头")
+                raise HTTPException(status_code=401, detail="Missing or invalid Authorization header")
+            api_key = authorization[7:]
+            if api_key != settings.AUTH_TOKEN:
+                debug_log(f"无效的API key: {api_key}")
+                raise HTTPException(status_code=401, detail="Invalid API key")
+            debug_log(f"API key验证通过，AUTH_TOKEN={api_key[:8]}......")
+        else:
+            debug_log("SKIP_AUTH_TOKEN已启用，跳过API key验证")
         debug_log(f"请求解析成功 - 模型: {request.model}, 流式: {request.stream}, 消息数: {len(request.messages)}")
         # Generate IDs
         # Determine model features
         is_thinking = request.model == settings.THINKING_MODEL
         is_search = request.model == settings.SEARCH_MODEL
+        is_air = request.model == settings.AIR_MODEL
         search_mcp = "deep-web-search" if is_search else ""
+        # Determine upstream model ID based on requested model
+        if is_air:
+            upstream_model_id = "0727-106B-API"  # AIR model upstream ID
+            upstream_model_name = "GLM-4.5-Air"
+        else:
+            upstream_model_id = "0727-360B-API"  # Default upstream model ID
+            upstream_model_name = "GLM-4.5"
         # Build upstream request
         upstream_req = UpstreamRequest(
             stream=True,  # Always use streaming from upstream
             chat_id=chat_id,
             id=msg_id,
+            model=upstream_model_id,  # Dynamic upstream model ID
             messages=upstream_messages,
             params={},
             features={
             },
             mcp_servers=[search_mcp] if search_mcp else [],
             model_item=ModelItem(
+                id=upstream_model_id,
+                name=upstream_model_name,
                 owned_by="openai"
             ),
             tool_servers=[],

app/core/config.py CHANGED Viewed

@@ -20,6 +20,7 @@ class Settings(BaseSettings):
     PRIMARY_MODEL: str = os.getenv("PRIMARY_MODEL", "GLM-4.5")
     THINKING_MODEL: str = os.getenv("THINKING_MODEL", "GLM-4.5-Thinking")
     SEARCH_MODEL: str = os.getenv("SEARCH_MODEL", "GLM-4.5-Search")
     # Server Configuration
     LISTEN_PORT: int = int(os.getenv("LISTEN_PORT", "8080"))
@@ -30,6 +31,7 @@ class Settings(BaseSettings):
     ANONYMOUS_MODE: bool = os.getenv("ANONYMOUS_MODE", "true").lower() == "true"
     TOOL_SUPPORT: bool = os.getenv("TOOL_SUPPORT", "true").lower() == "true"
     SCAN_LIMIT: int = int(os.getenv("SCAN_LIMIT", "200000"))
     # Browser Headers
     CLIENT_HEADERS: Dict[str, str] = {

     PRIMARY_MODEL: str = os.getenv("PRIMARY_MODEL", "GLM-4.5")
     THINKING_MODEL: str = os.getenv("THINKING_MODEL", "GLM-4.5-Thinking")
     SEARCH_MODEL: str = os.getenv("SEARCH_MODEL", "GLM-4.5-Search")
+    AIR_MODEL: str = os.getenv("AIR_MODEL", "GLM-4.5-Air")
     # Server Configuration
     LISTEN_PORT: int = int(os.getenv("LISTEN_PORT", "8080"))
     ANONYMOUS_MODE: bool = os.getenv("ANONYMOUS_MODE", "true").lower() == "true"
     TOOL_SUPPORT: bool = os.getenv("TOOL_SUPPORT", "true").lower() == "true"
     SCAN_LIMIT: int = int(os.getenv("SCAN_LIMIT", "200000"))
+    SKIP_AUTH_TOKEN: bool = os.getenv("SKIP_AUTH_TOKEN", "false").lower() == "true"
     # Browser Headers
     CLIENT_HEADERS: Dict[str, str] = {

deploy/docker-compose.yml CHANGED Viewed

@@ -11,6 +11,8 @@ services:
     environment:
       # Auth Configuration
       - AUTH_TOKEN=sk-your-api-key
       # Server Configurations
       - DEBUG_LOGGING=true
       # Feature Configuration

     environment:
       # Auth Configuration
       - AUTH_TOKEN=sk-your-api-key
+      # 是否跳过api key验证
+      - SKIP_AUTH_TOKEN=false
       # Server Configurations
       - DEBUG_LOGGING=true
       # Feature Configuration