File size: 18,199 Bytes
5c5b371 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 |
# Project Codebase Guide
This document serves as a guide and index for the project codebase, designed to help developers and AI agents quickly understand its structure, components, and how to contribute.
## Table of Contents
1. [Project Overview](#project-overview)
2. [Directory Structure](#directory-structure)
3. [Core Components](#core-components)
* [Configuration (`src/config.ts`)](#configuration)
* [Server Entry Point (`src/server.ts`)](#server-entry-point)
* [Proxy Layer (`src/proxy/`)](#proxy-layer)
* [User Management (`src/user/`)](#user-management)
* [Admin Interface (`src/admin/`)](#admin-interface)
* [Shared Utilities (`src/shared/`)](#shared-utilities)
4. [Proxy Functionality](#proxy-functionality)
* [Routing (`src/proxy/routes.ts`)](#proxy-routing)
* [Supported Models & Providers](#supported-models--providers)
* [Middleware (`src/proxy/middleware/`)](#proxy-middleware)
* [Adding New Models](#adding-new-models)
* [Adding New APIs/Providers](#adding-new-apisproviders)
5. [Model Management](#model-management)
* [Model Family Definitions](#model-family-definitions)
* [Adding OpenAI Models](#adding-openai-models)
* [Model Mapping & Routing](#model-mapping--routing)
* [Service Information](#service-information)
* [Step-by-Step Guide for Adding a New Model](#step-by-step-guide-for-adding-a-new-model)
* [Model Patterns and Versioning](#model-patterns-and-versioning)
* [Response Format Handling](#response-format-handling)
6. [Key Management](#key-management)
* [Key Pool System](#key-pool-system)
* [Provider-Specific Key Management](#provider-specific-key-management)
* [Key Rotation and Health Checks](#key-rotation-and-health-checks)
7. [Data Management](#data-management)
* [Database (`src/shared/database/`)](#database)
* [File Storage (`src/shared/file-storage/`)](#file-storage)
8. [Authentication & Authorization](#authentication--authorization)
9. [Logging & Monitoring](#logging--monitoring)
10. [Deployment](#deployment)
11. [Contributing](#contributing)
## Project Overview
This project provides a proxy layer for various Large Language Models (LLMs) and potentially other AI APIs. It aims to offer a unified interface, manage API keys securely, handle rate limiting, usage tracking, and potentially add features like response caching or prompt modification.
## Directory Structure
```
.
βββ .env.example # Example environment variables
βββ .gitattributes # Git attributes
βββ .gitignore # Git ignore rules
βββ .husky/ # Git hooks
βββ .prettierrc # Code formatting rules
βββ CODEBASE_GUIDE.md # This file
βββ README.md # Project README
βββ data/ # Data files (e.g., SQLite DB)
βββ docker/ # Docker configuration
βββ docs/ # Documentation files
βββ http-client.env.json # HTTP client environment
βββ package-lock.json # NPM lock file
βββ package.json # Project dependencies and scripts
βββ patches/ # Patches for dependencies
βββ public/ # Static assets served by the web server
βββ render.yaml # Render deployment configuration
βββ scripts/ # Utility scripts
βββ src/ # Source code
β βββ admin/ # Admin interface logic
β βββ config.ts # Application configuration
β βββ info-page.ts # Logic for the info page
β βββ logger.ts # Logging setup
β βββ proxy/ # Core proxy logic for different providers
β βββ server.ts # Express server setup and main entry point
β βββ service-info.ts # Service information logic
β βββ shared/ # Shared utilities, types, and modules
β βββ user/ # User management logic
βββ tsconfig.json # TypeScript configuration
```
## Core Components
### Configuration (`src/config.ts`)
* Loads environment variables and defines application settings.
* Contains configuration for database connections, API keys (placeholders/retrieval methods), logging levels, rate limits, etc.
* Uses `dotenv` and potentially a schema validation library (like Zod) to ensure required variables are present.
### Server Entry Point (`src/server.ts`)
* Initializes the Express application.
* Sets up core middleware (e.g., body parsing, CORS, logging).
* Mounts routers for different parts of the application (admin, user, proxy).
* Starts the HTTP server.
### Proxy Layer (`src/proxy/`)
* The heart of the application, handling requests to downstream AI APIs.
* Contains individual modules for each supported provider (e.g., `openai.ts`, `anthropic.ts`).
* Handles request transformation, authentication against the target API, and response handling.
* Uses middleware for common proxy tasks.
### User Management (`src/user/`)
* Handles user registration, login, session management, and potentially API key generation/management for end-users.
* Likely interacts with the database (`src/shared/database/`).
### Admin Interface (`src/admin/`)
* Provides an interface for administrators to manage users, monitor usage, configure settings, etc.
* May have its own set of routes and views.
### Shared Utilities (`src/shared/`)
* Contains reusable code across different modules.
* `api-schemas/`: Zod schemas for API request/response validation.
* `database/`: Database connection, schemas (e.g., Prisma), and query logic.
* `errors.ts`: Custom error classes.
* `key-management/`: Logic for managing API keys (if applicable).
* `models.ts`: Core data models/types used throughout the application.
* `prompt-logging/`: Logic for logging prompts and responses.
* `tokenization/`: Utilities for counting tokens.
* `utils.ts`: General utility functions.
## Proxy Functionality
### Proxy Routing (`src/proxy/routes.ts`)
* Defines the API endpoints for the proxy service (e.g., `/v1/chat/completions`).
* Maps incoming requests to the appropriate provider-specific handler based on the request path, headers, or body content (e.g., model requested).
* Applies relevant middleware (authentication, rate limiting, queuing, etc.).
### Supported Models & Providers
* **OpenAI:** Handled in `src/proxy/openai.ts`. Supports models like GPT-4, GPT-3.5-turbo, as well as o-series models (o1, o1-mini, o1-pro, o3, o3-mini, o3-pro, o4-mini). Handles chat completions and potentially image generation (`src/proxy/openai-image.ts`).
* **Anthropic:** Handled in `src/proxy/anthropic.ts`. Supports Claude models. May use AWS Bedrock (`src/proxy/aws-claude.ts`) or Anthropic's direct API.
* **Google AI / Vertex AI:** Handled in `src/proxy/google-ai.ts` and `src/proxy/gcp.ts`. Supports Gemini models (gemini-flash, gemini-pro, gemini-ultra).
* **Mistral AI:** Handled in `src/proxy/mistral-ai.ts`. Supports Mistral models via their API or potentially AWS (`src/proxy/aws-mistral.ts`).
* **Azure OpenAI:** Handled in `src/proxy/azure.ts`. Provides an alternative endpoint for OpenAI models via Azure.
* **Deepseek:** Handled in `src/proxy/deepseek.ts`.
* **Xai:** Handled in `src/proxy/xai.ts`.
* **AWS (General):** `src/proxy/aws.ts` might contain shared AWS logic (e.g., authentication).
### Middleware (`src/proxy/middleware/`)
* **`gatekeeper.ts`:** Likely handles initial request validation, authentication, and authorization checks before hitting provider logic. Checks origin (`check-origin.ts`), potentially custom tokens (`check-risu-token.ts`).
* **`rate-limit.ts`:** Implements rate limiting logic, potentially per-user or per-key.
* **`queue.ts`:** Manages request queuing, possibly to handle concurrency limits or prioritize requests.
### Adding New Models
1. **Identify the Provider:** Determine if the new model belongs to an existing provider (e.g., a new OpenAI model) or a new one.
2. **Update Provider Logic (if existing):**
* Modify the relevant provider file (e.g., `src/proxy/openai.ts`).
* Update model lists or logic that selects/validates models.
* Adjust any request/response transformations if the new model has a different API schema.
* Update model information in shared files like `src/shared/models.ts` if necessary.
3. **Update Routing (if necessary):** Modify `src/proxy/routes.ts` if the new model requires a different endpoint or routing logic.
4. **Configuration:** Add any new API keys or configuration parameters to `.env.example` and `src/config.ts`.
5. **Testing:** Add unit or integration tests for the new model.
### Adding New APIs/Providers
1. **Create Provider Module:** Create a new file in `src/proxy/` (e.g., `src/proxy/new-provider.ts`).
2. **Implement Handler:**
* Write the core logic to handle requests for this provider. This typically involves:
* Receiving the standardized request from the router.
* Transforming the request into the format expected by the new provider's API.
* Authenticating with the new provider's API (fetching keys from config).
* Making the API call (consider using a robust HTTP client like `axios` or `node-fetch`).
* Handling streaming responses if applicable (using helpers from `src/shared/streaming.ts`).
* Transforming the provider's response back into a standardized format.
* Handling errors gracefully.
3. **Add Routing:**
* Import the new handler in `src/proxy/routes.ts`.
* Add new routes or modify existing routing logic to direct requests to the new handler based on model name, path, or other criteria.
* Apply necessary middleware (gatekeeper, rate limiter, queue).
4. **Create Key Management:**
* Create a new directory in `src/shared/key-management/` for the provider.
* Implement provider-specific key management (key checkers, token counters).
5. **Configuration:**
* Add configuration variables (API keys, base URLs) to `.env.example` and `src/config.ts`.
* Update `src/config.ts` to load and validate the new variables.
6. **Model Information:** Add details about the new provider and its models to `src/shared/models.ts` or similar shared locations.
7. **Tokenization (if applicable):** If token counting is needed, add or update tokenization logic in `src/shared/tokenization/`.
8. **Testing:** Implement thorough tests for the new provider integration.
9. **Documentation:** Update this guide and any other relevant documentation.
## Model Management
### Model Family Definitions
* **Model Family Definitions:** The project uses a family-based approach to group similar models together. These are defined in `src/shared/models.ts`.
* Each model is part of a model family (e.g., "gpt4", "claude", "gemini-pro") which helps with routing, key management, and feature support.
* The `MODEL_FAMILIES` array contains all supported model families, and the `MODEL_FAMILY_SERVICE` mapping connects each family to its provider service.
### Adding OpenAI Models
When adding new OpenAI models to the codebase, there are several files that must be updated:
1. **Update Model Types (`src/shared/models.ts`):**
- Add the new model to the `OpenAIModelFamily` type
- Add the model to the `MODEL_FAMILIES` array
- Add the Azure variants for the model if applicable
- Add the model to `MODEL_FAMILY_SERVICE` mapping
- Update `OPENAI_MODEL_FAMILY_MAP` with regex patterns to match the model names
2. **Update Context Size Limits (`src/proxy/middleware/request/preprocessors/validate-context-size.ts`):**
- Add regex matching for the new model
- Set the appropriate context token limit for the model
3. **Update Token Cost Tracking (`src/shared/stats.ts`):**
- Add pricing information for the new model in the `getTokenCostUsd` function
- Include both input and output prices in the comments for clarity
4. **Update Feature Support Checks (`src/proxy/openai.ts`):**
- If the model supports special features like the reasoning API parameter (`isO1Model` function), update the appropriate function
- For model feature detection, prefer using regex patterns over explicit lists when possible, as this handles date-stamped versions better
5. **Update Display Names (`src/info-page.ts`):**
- Add friendly display names for the new models in the `MODEL_FAMILY_FRIENDLY_NAME` object
6. **Update Key Management Provider Files:**
- For OpenAI keys in `src/shared/key-management/openai/provider.ts`, add token counters for the new models
- For Azure OpenAI keys in `src/shared/key-management/azure/provider.ts`, add token counters for the Azure versions
### Model Patterns and Versioning
The codebase handles several patterns for model naming and versioning:
1. **Date-stamped Models:** Many models include date stamps (e.g., `gpt-4-0125-preview`). The regex patterns in `OPENAI_MODEL_FAMILY_MAP` account for these with patterns like `^gpt-4o(-\\d{4}-\\d{2}-\\d{2})?$`.
2. **O-Series Models:** OpenAI's o-series models (o1, o1-mini, o1-pro, o3, o3-mini, o3-pro, o4-mini) follow a different naming convention. The codebase handles these with dedicated model families and regex patterns.
3. **Preview/Non-Preview Variants:** Some models have preview variants (e.g., `gpt-4.5-preview`). The regex patterns in `OPENAI_MODEL_FAMILY_MAP` account for these with patterns like `^gpt-4\\.5(-preview)?(-\\d{4}-\\d{2}-\\d{2})?$`.
When adding new models, try to follow the existing patterns for consistency.
### Response Format Handling
The codebase includes special handling for different API response formats:
1. **Chat vs. Text Completions:** There's transformation logic in `openai.ts` to convert between chat completions and text completions formats (`transformTurboInstructResponse`).
2. **Newer API Formats:** For newer APIs like the Responses API, there's transformation logic (`transformResponsesApiResponse`) to convert responses to a format compatible with existing clients.
When adding support for new models or APIs, consider whether transformation is needed to maintain compatibility with existing clients.
## Key Management
### Key Pool System
The project uses a sophisticated key pool system (`src/shared/key-management/key-pool.ts`) to manage API keys for different providers. Key features include:
* **Key Selection:** The system selects the appropriate key based on model family, region preferences, and other criteria.
* **Rotation:** Keys are rotated to distribute usage and avoid hitting rate limits.
* **Health Checks:** Keys are checked periodically to ensure they're still valid and within rate limits.
### Provider-Specific Key Management
Each provider has its own key management module in `src/shared/key-management/`:
* **Key Checkers:** Each provider implements key checkers to validate keys and check their status.
* **Token Counters:** Providers implement token counting logic specific to their pricing model.
* **Models Support:** Keys are associated with specific model families they support.
When adding a new model or provider, you'll need to update or create the appropriate key management files.
### Key Rotation and Health Checks
The key pool system includes logic for:
* **Rotation Strategy:** Keys are selected based on a prioritization strategy (`prioritize-keys.ts`).
* **Disabling Unhealthy Keys:** Keys that fail health checks are temporarily disabled.
* **Rate Limit Awareness:** The system tracks usage to avoid hitting provider rate limits.
## Data Management
### Database (`src/shared/database/`)
* Likely uses Prisma or a similar ORM.
* Defines database schemas (e.g., for users, API keys, usage logs).
* Provides functions for interacting with the database.
* Configuration is managed in `src/config.ts`.
### File Storage (`src/shared/file-storage/`)
* May be used for storing logs, cached data, or user-uploaded files.
* Could integrate with local storage or cloud providers (e.g., S3, GCS).
## Authentication & Authorization
* **User Auth:** Handled in `src/user/` potentially using sessions (`src/shared/with-session.ts`) or JWTs.
* **Proxy Auth:** The `gatekeeper.ts` middleware likely verifies incoming requests to the proxy endpoints. This could involve checking:
* Custom API keys stored in the database (`src/shared/database/`).
* Specific tokens (`check-risu-token.ts`).
* HMAC signatures (`src/shared/hmac-signing.ts`).
* Origin checks (`check-origin.ts`).
* **Downstream Auth:** Each provider module (`src/proxy/*.ts`) handles authentication with the actual AI service API using keys from the configuration.
## Logging & Monitoring
* **Logging:** Configured in `src/logger.ts`, likely using a library like `pino` or `winston`. Logs requests, errors, and important events.
* **Prompt Logging:** Specific logic for logging prompts and responses might exist in `src/shared/prompt-logging/`.
* **Stats/Monitoring:** `src/shared/stats.ts` might handle collecting and exposing application metrics.
## Deployment
* **Docker:** The project likely includes Docker configuration for containerized deployment.
* **Render:** The `render.yaml` file suggests the project is or can be deployed on Render.
* **Environment Variables:** The `.env.example` file provides a template for required environment variables in production.
## Contributing
When contributing to this project:
1. **Follow Coding Standards:** Use the established patterns and standards in the codebase. The `.prettierrc` file defines code formatting rules.
2. **Update Documentation:** Keep this guide updated when adding new components or changing existing ones.
3. **Add Tests:** Ensure your changes are tested appropriately.
4. **Update Configuration:** If your changes require new environment variables, update `.env.example`.
*This guide provides a high-level overview. For detailed information, refer to the specific source code files.*
|