Commit History

Update NVIDIA NIM models - keep top 5, add minimax-m2.7
ef22b95
Running

Yash030 commited on

Fix Silicon Flow model IDs with correct API names
24b9325

Yash030 commited on

Keep only 5 best Silicon Flow models for free tier
5bba595

Yash030 commited on

Add high-rate-limit Silicon Flow models for free tier usage
c9c8b95

Yash030 commited on

Better handle rate limit errors that come as 413 or BadRequestError.
c66ebaa

Yash030 Claude Opus 4.7 commited on

Add detailed error mapping debug logs.
860ed88

Yash030 Claude Opus 4.7 commited on

Add error details to APIError messages.
2492112

Yash030 Claude Opus 4.7 commited on

Add routing debug log to trace provider resolution.
65739aa

Yash030 Claude Opus 4.7 commited on

Add detailed error info to auth failure messages.
979bf9b

Yash030 Claude Opus 4.7 commited on

Add credential logging at provider creation.
a1a14b2

Yash030 Claude Opus 4.7 commited on

Add detailed error logging to trace API failures.
ce65a74

Yash030 Claude Opus 4.7 commited on

Add Cerebras, Silicon Flow, and Groq providers with debug logging.
db83b53

Yash030 Claude Opus 4.7 commited on

$(cat <<EOF
332dd16

Yash030 commited on

$(cat <<EOF
98fdd46

Yash030 commited on

$(cat <<EOF
55f294b

Yash030 commited on

$(cat <<EOF
0ba585f

Yash030 commited on

$(cat <<EOF
58a3721

Yash030 commited on

Add Cerebras and Silicon Flow models to REQUESTED_PROVIDER_MODELS
0223890

Yash030 Claude Opus 4.7 commited on

Add Z.ai GLM 4.7 model to Cerebras provider
6993cbc

Yash030 Claude Opus 4.7 commited on

Add Cerebras and Silicon Flow provider support
43ea069

Yash030 Claude Opus 4.7 commited on

Extend session visibility in admin dashboard with configurable retention
6339a53

Yash030 Claude Opus 4.7 commited on

Implement image support in proxy with vision-aware routing
574e4e7

Yash030 Claude Opus 4.7 commited on

NIM speed optimization — adaptive rate limiting and increased throughput
aa9c0b0

Yash030 Claude Opus 4.7 commited on

Fix stale sessions in admin dashboard and improve auto-routing health
188ffa9

Yash030 Claude Opus 4.7 commited on

docs: update CLAUDE.md with auto-routing optimizations
84a115b

Yash030 Claude Opus 4.7 commited on

Optimize auto routing: Zen unlimited, smarter fallback skipping
9358a6f

Yash030 Claude Opus 4.7 commited on

Track sessions via X-Session-ID header for accurate admin dashboard
948c8f9

Yash030 Claude Opus 4.7 commited on

docs: update CLAUDE.md with model list and health tracking
d64f2a2

Yash030 Claude Opus 4.7 commited on

Add free Zen models: big-pickle, ring-2.6-1t-free, nemotron-3-super-free
04fcbd7

Yash030 Claude Opus 4.7 commited on

Speed up NIM provider with failure tracking and faster timeouts
a5ea640

Yash030 Claude Opus 4.7 commited on

docs: complete README refactor with cloud deploy guide
fcc5278

Yash030 Claude Opus 4.7 commited on

Speed optimizations and enhanced auto-model fallback routing
ebba9d6

Yash030 Claude Opus 4.7 commited on

Performance optimizations for faster proxy routing
49813da

Yash030 Claude Opus 4.7 commited on

Display admin dashboard on root page
1985e64

Yash030 Claude Opus 4.7 commited on

Upgrade admin dashboard to terminal-style dark theme
a47c2e0

Yash030 Claude Opus 4.7 commited on

Track sessions by gateway client IP
f56589d

Yash030 Claude Opus 4.7 commited on

Fix: use sync version of track_request to avoid SyntaxError
b5bd2a8

Yash030 Claude Opus 4.7 commited on

Fix: sessions tracking now works correctly in dashboard and JSON
ef123a8

Yash030 Claude Opus 4.7 commited on

Add templates directory to Dockerfile for admin dashboard
89ba257

Yash030 Claude Opus 4.7 commited on

Update CLAUDE.md with admin dashboard docs
0015069

Yash030 Claude Opus 4.7 commited on

Add jinja2 explicit dependency for admin dashboard templates
f69902b

Yash030 Claude Opus 4.7 commited on

Add admin dashboard for session monitoring
f3220aa

Yash030 Claude Opus 4.7 commited on

Add git origins table to README
28e4b90

Yash030 commited on

Add smart task-aware routing (Phase 1)
4974012

Yash030 commited on

Fix httpx keepalive_expiry parameter name
8238c16

Yash030 commited on

Performance optimizations for proxy speed and shared sessions
cc3287d

Yash030 Claude Opus 4.7 commited on

Remove groq/cerebras, add zen model
d6a1875

Yash030 commited on

Remove groq from provider catalog
af4fba5

Yash030 commited on

Remove readme reference from pyproject.toml
981da17

Yash030 commited on

Add HF Space metadata
7745218

Yash030 commited on