z2api

Sleeping

App Files Files Community

zhaoxiaozhao07 commited on Oct 1, 2025

Commit

9f61557

1 Parent(s): 99ab6ae

feat(core): 更新环境配置示例和文档以支持 GLM-4.6 模型，增强 Token 管理逻辑,可以直接.env中以”，“分割放置多个

Browse files

Files changed (3) hide show

.env.example +3 -3
README.md +99 -5
app/core/token_manager.py +54 -23

.env.example CHANGED Viewed

@@ -11,16 +11,16 @@ AUTH_TOKEN=sk-your-api-key
 # 跳过客户端认证（仅开发环境使用）
 SKIP_AUTH_TOKEN=false
-# Z.ai 备用访问令牌（当匿名模式失败时使用）
 # 注意：这是用于访问 Z.ai 服务的令牌，不是客户端认证密钥
-BACKUP_TOKEN=eyJhbG.....iyDqkjPGsaiQ
 # ========== 服务器配置 ==========
 # 服务监听端口
 LISTEN_PORT=8080
 # 调试日志开关
-DEBUG_LOGGING=true
 # ========== 功能配置 ==========
 # 思考内容处理策略

 # 跳过客户端认证（仅开发环境使用）
 SKIP_AUTH_TOKEN=false
+# Z.ai 备用访问令牌（当匿名模式失败时使用）,可以以','分隔多个令牌,也可以放入到tokens.txt文件中
 # 注意：这是用于访问 Z.ai 服务的令牌，不是客户端认证密钥
+BACKUP_TOKEN=token_1,token_2,token_3....
 # ========== 服务器配置 ==========
 # 服务监听端口
 LISTEN_PORT=8080
 # 调试日志开关
+DEBUG_LOGGING=false
 # ========== 功能配置 ==========
 # 思考内容处理策略

README.md CHANGED Viewed

@@ -5,7 +5,7 @@
 ![FastAPI](https://img.shields.io/badge/framework-FastAPI-009688.svg)
 ![Version: 1.2.0](https://img.shields.io/badge/version-1.2.0-brightgreen.svg)
-轻量级 OpenAI API 兼容代理服务，通过 Claude Code Router 接入 Z.AI，支持 GLM-4.5 系列模型的完整功能。
 ## ✨ 核心特性
@@ -109,7 +109,7 @@ tools = [{
 # 使用工具
 response = client.chat.completions.create(
-    model="GLM-4.5",
     messages=[{"role": "user", "content": "北京天气怎么样？"}],
     tools=tools,
     tool_choice="auto"
@@ -120,7 +120,7 @@ response = client.chat.completions.create(
 ```python
 response = client.chat.completions.create(
-    model="GLM-4.5-Thinking",
     messages=[{"role": "user", "content": "解释量子计算"}],
     stream=True
 )
@@ -150,7 +150,10 @@ for chunk in response:
 | `TOOL_SUPPORT`        | `true`                                    | Function Call 功能开关 |
 | `SKIP_AUTH_TOKEN`     | `false`                                   | 跳过认证令牌验证       |
 | `SCAN_LIMIT`          | `200000`                                  | 扫描限制               |
-| `BACKUP_TOKEN`        | `eyJhbGciOiJFUzI1NiIsInR5cCI6IkpXVCJ9...` | Z.ai 固定访问令牌      |
 ### 思考内容处理策略
@@ -158,6 +161,94 @@ for chunk in response:
 - `strip` - 移除思考内容
 - `raw` - 保留原始格式
 ## 🎯 使用场景
 ### 1. AI 应用开发
@@ -174,7 +265,7 @@ client = OpenAI(
 # 智能客服
 def chat_with_ai(message):
     response = client.chat.completions.create(
-        model="GLM-4.5",
         messages=[{"role": "user", "content": message}]
     )
     return response.choices[0].message.content
@@ -279,6 +370,9 @@ A:
 - **GLM-4.5-Thinking**: 需要了解推理过程的场��
 - **GLM-4.5-Search**: 需要实时信息的场景
 - **GLM-4.5-Air**: 高并发、低延迟要求的场景
 **Q: 如何自定义配置？**
 A: 通过环境变量配置，推荐使用 `.env` 文件。

 ![FastAPI](https://img.shields.io/badge/framework-FastAPI-009688.svg)
 ![Version: 1.2.0](https://img.shields.io/badge/version-1.2.0-brightgreen.svg)
+轻量级 OpenAI API 兼容代理服务，通过 Claude Code Router 接入 Z.AI，支持 GLM-4.6 系列模型的完整功能。
 ## ✨ 核心特性
 # 使用工具
 response = client.chat.completions.create(
+    model="GLM-4.6",
     messages=[{"role": "user", "content": "北京天气怎么样？"}],
     tools=tools,
     tool_choice="auto"
 ```python
 response = client.chat.completions.create(
+    model="GLM-4.6-Thinking",
     messages=[{"role": "user", "content": "解释量子计算"}],
     stream=True
 )
 | `TOOL_SUPPORT`        | `true`                                    | Function Call 功能开关 |
 | `SKIP_AUTH_TOKEN`     | `false`                                   | 跳过认证令牌验证       |
 | `SCAN_LIMIT`          | `200000`                                  | 扫描限制               |
+| `BACKUP_TOKEN`        | `eyJhbGciO...`                            | 固定访问令牌，多个以','分隔|
+| `TOKEN_FILE_PATH`     | `./tokens.txt`                            | Token文件路径          |
+| `TOKEN_MAX_FAILURES`  | `3`                                       | Token最大失败次数      |
+| `TOKEN_RELOAD_INTERVAL`| `60`                                     | Token重载间隔(秒)      |
 ### 思考内容处理策略
 - `strip` - 移除思考内容
 - `raw` - 保留原始格式
+## 🔑 Token 轮询管理
+系统支持智能 Token 轮询管理，可以在多个 Token 之间自动切换，实现负载均衡和容错处理。
+### Token 来源
+系统按以下优先级加载 Token：
+1. **tokens.txt 文件** - 在项目根目录创建 `tokens.txt` 文件，每行一个 Token
+2. **BACKUP_TOKEN 环境变量** - 支持多个 Token，以逗号分隔
+### tokens.txt 文件格式
+```
+# 这是注释，会被忽略
+sk-your-first-token-here
+sk-your-second-token-here
+sk-your-third-token-here
+```
+### BACKUP_TOKEN 环境变量格式
+```bash
+# 单个 Token
+BACKUP_TOKEN=sk-your-token-here
+# 多个 Token（以逗号分隔）
+BACKUP_TOKEN=sk-first-token,sk-second-token,sk-third-token
+```
+### Token 轮询机制
+- **轮询策略**：采用轮询（Round-Robin）算法，依次使用每个可用 Token
+- **失败处理**：当 Token 失败时，系统会标记失败次数，达到最大失败次数后自动禁用
+- **自动恢复**：禁用的 Token 会在重新加载时重置状态
+- **去重机制**：自动去除重复的 Token，确保每个 Token 只使用一次
+- **状态保持**：保留已有 Token 的失败计数和使用状态
+### Token 配置参数
+| 参数                 | 默认值 | 说明                         |
+| -------------------- | ------ | ---------------------------- |
+| `TOKEN_FILE_PATH`    | `./tokens.txt` | Token 文件路径               |
+| `TOKEN_MAX_FAILURES` | `3`    | Token 最大失败次数           |
+| `TOKEN_RELOAD_INTERVAL` | `60`  | Token 重载间隔（秒）         |
+### 使用示例
+#### 1. 仅使用 tokens.txt
+创建 `tokens.txt` 文件：
+```
+sk-token-1
+sk-token-2
+sk-token-3
+```
+#### 2. 仅使用 BACKUP_TOKEN
+在 `.env` 文件中配置：
+```env
+BACKUP_TOKEN=sk-token-1,sk-token-2,sk-token-3
+```
+#### 3. 同时使用 tokens.txt 和 BACKUP_TOKEN
+系统会合并两个来源的 Token，自动去重：
+- `tokens.txt` 包含：`sk-token-1`, `sk-token-2`
+- `BACKUP_TOKEN` 包含：`sk-token-2`, `sk-token-3`
+- 最终 Token 池：`sk-token-1`, `sk-token-2`, `sk-token-3`
+### Token 状态监控
+系统提供了 Token 状态统计接口，可以查看：
+- Token 总数
+- 活跃 Token 数量
+- 失败 Token 数量
+- 每个 Token 的详细信息（预览、状态、失败次数等）
+### 最佳实践
+1. **Token 分散**：将 Token 分散存储在 `tokens.txt` 和 `BACKUP_TOKEN` 中
+2. **定期更新**：定期检查 Token 有效性，及时替换失效的 Token
+3. **监控状态**：关注 Token 失败情况，及时调整配置
+4. **合理设置**：根据 API 调用频率调整 `TOKEN_MAX_FAILURES` 和 `TOKEN_RELOAD_INTERVAL`
 ## 🎯 使用场景
 ### 1. AI 应用开发
 # 智能客服
 def chat_with_ai(message):
     response = client.chat.completions.create(
+        model="GLM-4.6",
         messages=[{"role": "user", "content": message}]
     )
     return response.choices[0].message.content
 - **GLM-4.5-Thinking**: 需要了解推理过程的场��
 - **GLM-4.5-Search**: 需要实时信息的场景
 - **GLM-4.5-Air**: 高并发、低延迟要求的场景
+- **GLM-4.6**: 最新模型，性能和效果最佳
+- **GLM-4.6-Thinking**: 模型推理过程
 **Q: 如何自定义配置？**
 A: 通过环境变量配置，推荐使用 `.env` 文件。

app/core/token_manager.py CHANGED Viewed

@@ -60,31 +60,62 @@ class TokenManager:
     def _load_tokens(self) -> None:
         """Load tokens from file"""
         try:
-            if not os.path.exists(self.token_file_path):
-                debug_log(f"Token文件不存在: {self.token_file_path}")
-                # Fallback to BACKUP_TOKEN if file doesn't exist
                 try:
                     from app.core.config import settings
                     if hasattr(settings, 'BACKUP_TOKEN') and settings.BACKUP_TOKEN:
-                        self.tokens = [TokenInfo(token=settings.BACKUP_TOKEN)]
-                        debug_log("使用配置文件中的BACKUP_TOKEN作为备用")
                 except ImportError:
                     pass
-                return
-            with open(self.token_file_path, 'r', encoding='utf-8') as f:
-                lines = f.readlines()
-            new_tokens = []
-            for line in lines:
-                token = line.strip()
-                if token and not token.startswith('#'):  # Skip empty lines and comments
-                    # Check if this token already exists to preserve failure count
-                    existing_token = next((t for t in self.tokens if t.token == token), None)
-                    if existing_token:
-                        new_tokens.append(existing_token)
-                    else:
-                        new_tokens.append(TokenInfo(token=token))
             if new_tokens:
                 with self._lock:
@@ -94,14 +125,14 @@ class TokenManager:
                         self.current_index = 0
                     self.last_reload_time = time.time()
-                debug_log(f"成功加载 {len(self.tokens)} 个token")
                 active_count = sum(1 for t in self.tokens if t.is_active)
                 debug_log(f"活跃token数量: {active_count}")
             else:
-                debug_log("Token文件为空或无有效token")
         except Exception as e:
-            debug_log(f"加载token文件失败: {e}")
     def _should_reload(self) -> bool:
         """Check if tokens should be reloaded"""

     def _load_tokens(self) -> None:
         """Load tokens from file"""
         try:
+            new_tokens = []
+            # 首先尝试从tokens.txt文件加载token
+            if os.path.exists(self.token_file_path):
+                with open(self.token_file_path, 'r', encoding='utf-8') as f:
+                    lines = f.readlines()
+                for line in lines:
+                    token = line.strip()
+                    if token and not token.startswith('#'):  # Skip empty lines and comments
+                        # Check if this token already exists to preserve failure count
+                        existing_token = next((t for t in self.tokens if t.token == token), None)
+                        if existing_token:
+                            new_tokens.append(existing_token)
+                        else:
+                            new_tokens.append(TokenInfo(token=token))
+                if new_tokens:
+                    debug_log(f"从tokens.txt文件加载了 {len(new_tokens)} 个token")
+                else:
+                    debug_log("Token文件为空或无有效token")
+            # 然后尝试从BACKUP_TOKEN环境变量加载token
+            try:
+                from app.core.config import settings
+                if hasattr(settings, 'BACKUP_TOKEN') and settings.BACKUP_TOKEN:
+                    # 支持多个BACKUP_TOKEN值，以逗号分隔
+                    backup_tokens = [token.strip() for token in settings.BACKUP_TOKEN.split(',') if token.strip()]
+                    # 添加不重复的backup token
+                    for backup_token in backup_tokens:
+                        # 检查是否已经存在相同的token
+                        existing_token = next((t for t in new_tokens if t.token == backup_token), None)
+                        if not existing_token:
+                            # 检查是否在原有tokens中存在，以保留失败计数
+                            old_token = next((t for t in self.tokens if t.token == backup_token), None)
+                            if old_token:
+                                new_tokens.append(old_token)
+                            else:
+                                new_tokens.append(TokenInfo(token=backup_token))
+                    debug_log(f"从BACKUP_TOKEN加载了 {len(backup_tokens)} 个token")
+            except ImportError:
+                pass
+            # 如果没有任何token，尝试仅使用BACKUP_TOKEN
+            if not new_tokens:
                 try:
                     from app.core.config import settings
                     if hasattr(settings, 'BACKUP_TOKEN') and settings.BACKUP_TOKEN:
+                        # 支持多个BACKUP_TOKEN值，以逗号分隔
+                        backup_tokens = [token.strip() for token in settings.BACKUP_TOKEN.split(',') if token.strip()]
+                        new_tokens = [TokenInfo(token=token) for token in backup_tokens]
+                        debug_log(f"仅使用BACKUP_TOKEN，共{len(backup_tokens)}个token")
                 except ImportError:
                     pass
             if new_tokens:
                 with self._lock:
                         self.current_index = 0
                     self.last_reload_time = time.time()
+                debug_log(f"总共加载了 {len(self.tokens)} 个token")
                 active_count = sum(1 for t in self.tokens if t.is_active)
                 debug_log(f"活跃token数量: {active_count}")
             else:
+                debug_log("没有找到任何可用的token")
         except Exception as e:
+            debug_log(f"加载token失败: {e}")
     def _should_reload(self) -> bool:
         """Check if tokens should be reloaded"""