Spaces:

elmerzole
/

llm-api-proxy

Paused

App Files Files Community

llm-api-proxy / Deployment guide.md

Mirrowel

fix: guide rename

5b211f4 7 months ago

preview code

raw

history blame

7.61 kB

Easy Guide to Deploying LLM-API-Key-Proxy on Render

This guide walks you through deploying the LLM-API-Key-Proxy as a hosted service on Render.com. The project provides a universal, OpenAI-compatible API endpoint for all your LLM providers (like Gemini or OpenAI), powered by an intelligent key management library. It's perfect for integrating with platforms like JanitorAI, where you can use it as a custom proxy for highly available and resilient chats.

The process is beginner-friendly and takes about 15-30 minutes. We'll use Render's free tier (with limitations like sleep after 15 minutes of inactivity) and upload your .env file as a secret for easy key management—no manual entry of variables required.

Prerequisites

A free Render.com account (sign up at render.com).
A GitHub account (for forking the repo).
Basic terminal access (e.g., Command Prompt, Terminal, or Git Bash).
API keys from LLM providers (e.g., Gemini, OpenAI—get them from their dashboards). For details on supported providers and how to format their keys (e.g., API key naming conventions), refer to the LiteLLM Providers Documentation.

Note: You don't need Python installed for initial testing—use the pre-compiled Windows EXE from the repo's releases for a quick local trial.

Step 1: Test Locally with the Compiled EXE (No Python Required)

Before deploying, try the proxy locally to ensure your keys work. This uses a pre-built executable that's easy to set up.

Go to the repo's GitHub Releases page.
Download the latest release ZIP file (e.g., for Windows).
Unzip the file.
Double-click setup_env.bat. A window will open—follow the prompts to add your PROXY_API_KEY (a strong secret you create) and provider keys. Use the LiteLLM Providers Documentation for guidance on key formats (e.g., GEMINI_API_KEY_1="your-key").
Double-click proxy_app.exe to start the proxy. It runs at http://127.0.0.1:8000—visit in a browser to confirm "API Key Proxy is running".
Test with curl (replace with your PROXY_API_KEY):

curl -X POST http://127.0.0.1:8000/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer your-proxy-key" -d '{"model": "gemini/gemini-2.5-flash", "messages": [{"role": "user", "content": "What is the capital of France?"}]}'

- Expected: A JSON response with the answer (e.g., "Paris").

If it works, you're ready to deploy. If not, double-check your keys against LiteLLM docs.

Step 2: Fork and Prepare the Repository

Go to the original repo: https://github.com/Mirrowel/LLM-API-Key-Proxy.
Click Fork in the top-right to create your own copy (this lets you make changes if needed).
Clone your forked repo locally:

git clone https://github.com/YOUR-USERNAME/LLM-API-Key-Proxy.git
cd LLM-API-Key-Proxy

Step 3: Assemble Your .env File

The proxy uses a .env file to store your API keys securely. We'll create this based on the repo's documentation.

In your cloned repo, copy the example: copy .env.example .env (Windows) or cp .env.example .env (macOS/Linux).
Open .env in a text editor (e.g., Notepad or VS Code).
Add your keys following the format from the repo's README and LiteLLM Providers Documentation:
- PROXY_API_KEY: Create a strong, unique secret (e.g., "my-super-secret-proxy-key"). This authenticates requests to your proxy.
- Provider Keys: Add keys for your chosen providers. You can add multiple per provider (e.g., _1, _2) for rotation.

Example .env (customize with your real keys):

# Your proxy's authentication key (invent a strong one)
PROXY_API_KEY="my-super-secret-proxy-key"

# Provider API keys (get from provider dashboards; see LiteLLM docs for formats)
GEMINI_API_KEY_1="your-gemini-key-here"
GEMINI_API_KEY_2="another-gemini-key"

OPENROUTER_API_KEY_1="your-openrouter-key"

- Supported providers: Check LiteLLM docs for a full list and specifics (e.g., GEMINI, OPENROUTER, NVIDIA_NIM).
- Tip: Start with 1-2 providers to test. Don't share this file publicly!

Save the file. (We'll upload it to Render in Step 5.)

Step 4: Create a New Web Service on Render

Log in to render.com and go to your Dashboard.
Click New > Web Service.
Choose Build and deploy from a Git repository > Next.
Connect your GitHub account and select your forked repo.
In the setup form:
- Name: Something like "llm-api-key-proxy".
- Region: Choose one close to you (e.g., Oregon for US West).
- Branch: "main" (or your default).
- Runtime: Python 3.
- Build Command: pip install -r requirements.txt.
- Start Command: uvicorn src.proxy_app.main:app --host 0.0.0.0 --port $PORT.
- Instance Type: Free (for testing; upgrade later for always-on service).
Click Create Web Service. Render will build and deploy—watch the progress in the Events tab.

Step 5: Upload .env as a Secret File

Render mounts secret files securely at runtime, making your .env available to the app without exposing it.

In your new service's Dashboard, go to Environment > Secret Files.
Click Add Secret File.
File Path: Don't change. Keep it as root directory of the repo.
Contents: Upload the .env file you created previously.
Save. This injects the file for the app to load via dotenv (already in the code).
Trigger a redeploy: Go to Deploy > Manual Deploy > Deploy HEAD (or push a small change to your repo).

Your keys are now loaded automatically!

Step 6: Test Your Deployed Proxy

Note your service URL: It's in the Dashboard (e.g., https://llm-api-key-proxy.onrender.com).
Test with curl (replace with your PROXY_API_KEY):

curl -X POST https://your-service.onrender.com/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer your-proxy-key" -d '{"model": "gemini/gemini-2.5-flash", "messages": [{"role": "user", "content": "What is the capital of France?"}]}'

- Expected: A JSON response with the answer (e.g., "Paris").

Check logs in Render's Dashboard for startup messages (e.g., "RotatingClient initialized").

Step 7: Integrate with JanitorAI

Log in to janitorai.com and go to API settings (usually in a chat or account menu).
Select "Proxy" mode.
API URL: https://your-service.onrender.com/v1.
API Key: Your PROXY_API_KEY (from .env).
Model: Format as "provider/model" (e.g., "gemini/gemini-2.5-flash"; check LiteLLM docs for options).
Save and test a chat—messages should route through your proxy.

Troubleshooting

Build Fails: Check Render logs for missing dependencies—ensure requirements.txt is up to date.
401 Unauthorized: Verify your PROXY_API_KEY matches exactly (case-sensitive) and includes "Bearer " in requests. Or you have no keys for the provider/model added that you are trying to use.
405 on OPTIONS: If CORS issues arise, add the middleware from Step 3 and redeploy.
Service Sleeps: Free tier sleeps after inactivity—first requests may delay.
Provider Key Issues: Double-check formats in LiteLLM Providers Documentation.
More Help: Check Render docs or the repo's README. If stuck, share error logs.

That is it.