Commit History

**Remove stale model entries and tidy configuration**
1ce6135

agrawal-prateek commited on

Add qwen3.5‑397b‑a17b model entries (NVIDIA and Ollama) and replace all hard‑coded NVIDIA `api_base` URLs with `os.environ/NVIDIA_BASE_URL` in `litellm_config.yaml`.
df2661e

agrawal-prateek commited on

Add `order`, `rpm`, `max_parallel_requests`, and `tags` fields to model configurations; extend `context_window_fallbacks` and `fallbacks` for planner, executor, and leaf‑operator.
b8332e3

agrawal-prateek commited on

Add `power` NVIDIA‑only alias with ordered frontier ladder, configure its models in `litellm_config.yaml`, and tighten retry limits (set `allowed_fails` to 1 and reduce timeout/rate‑limit retries to 1).
f6a364f

agrawal-prateek commited on

Add `input_type: "passage"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml.
4098a2d

agrawal-prateek commited on

Add `encoding_format: "float"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml.
bddd47b

agrawal-prateek commited on

Add `encoding_format: "float"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml.
bad0411

agrawal-prateek commited on

Add code embedding model (nvidia/nv-embedcode-7b-v1) configuration to litellm_config.yaml.
1b8cdb3

agrawal-prateek commited on

Add NVIDIA nv‑embedcode‑7b‑v1 embedding model configuration to litellm_config.yaml.
54d6bca

agrawal-prateek commited on

Add Gemini rate‑limits markdown, Gemini models JSON, and NVIDIA model catalog markdown files.
b690b81

agrawal-prateek commited on

Add extensive model configurations (chat, reasoning, coding, multimodal, size bands, etc.) to litellm_config.yaml.
36497ae

agrawal-prateek commited on

Add `database_url` to `litellm_config.yaml` and document admin panel requirements in README.
d99134f

agrawal-prateek commited on