Spaces:

agrawal-prateek
/

llm-server

Runtime error

App Files Files Community

llm-server

Commit History

Remove stale model entries and tidy configuration

1ce6135

agrawal-prateek commited on Apr 4

Add qwen3.5‑397b‑a17b model entries (NVIDIA and Ollama) and replace all hard‑coded NVIDIA `api_base` URLs with `os.environ/NVIDIA_BASE_URL` in `litellm_config.yaml`.

df2661e

agrawal-prateek commited on Apr 3

Add `order`, `rpm`, `max_parallel_requests`, and `tags` fields to model configurations; extend `context_window_fallbacks` and `fallbacks` for planner, executor, and leaf‑operator.

b8332e3

agrawal-prateek commited on Apr 3

Add `power` NVIDIA‑only alias with ordered frontier ladder, configure its models in `litellm_config.yaml`, and tighten retry limits (set `allowed_fails` to 1 and reduce timeout/rate‑limit retries to 1).

f6a364f

agrawal-prateek commited on Apr 1

Add `input_type: "passage"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml.

4098a2d

agrawal-prateek commited on Apr 1

Add `encoding_format: "float"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml.

bddd47b

agrawal-prateek commited on Apr 1

Add `encoding_format: "float"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml.

bad0411

agrawal-prateek commited on Apr 1

Add code embedding model (nvidia/nv-embedcode-7b-v1) configuration to litellm_config.yaml.

1b8cdb3

agrawal-prateek commited on Apr 1

Add NVIDIA nv‑embedcode‑7b‑v1 embedding model configuration to litellm_config.yaml.

54d6bca

agrawal-prateek commited on Apr 1

Add Gemini rate‑limits markdown, Gemini models JSON, and NVIDIA model catalog markdown files.

b690b81

agrawal-prateek commited on Apr 1

Add extensive model configurations (chat, reasoning, coding, multimodal, size bands, etc.) to litellm_config.yaml.

36497ae

agrawal-prateek commited on Mar 31

```markdown

2adee7a

agrawal-prateek commited on Mar 31

Add `database_url` to `litellm_config.yaml` and document admin panel requirements in README.

d99134f

agrawal-prateek commited on Mar 31

initial commit

951381f

agrawal-prateek commited on Mar 30

Commit History

**Remove stale model entries and tidy configuration** 1ce6135

Add qwen3.5‑397b‑a17b model entries (NVIDIA and Ollama) and replace all hard‑coded NVIDIA `api_base` URLs with `os.environ/NVIDIA_BASE_URL` in `litellm_config.yaml`. df2661e

Add `order`, `rpm`, `max_parallel_requests`, and `tags` fields to model configurations; extend `context_window_fallbacks` and `fallbacks` for planner, executor, and leaf‑operator. b8332e3

Add `power` NVIDIA‑only alias with ordered frontier ladder, configure its models in `litellm_config.yaml`, and tighten retry limits (set `allowed_fails` to 1 and reduce timeout/rate‑limit retries to 1). f6a364f

Add `input_type: "passage"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml. 4098a2d

Add `encoding_format: "float"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml. bddd47b

Add `encoding_format: "float"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml. bad0411

Add code embedding model (nvidia/nv-embedcode-7b-v1) configuration to litellm_config.yaml. 1b8cdb3

Add NVIDIA nv‑embedcode‑7b‑v1 embedding model configuration to litellm_config.yaml. 54d6bca

Add Gemini rate‑limits markdown, Gemini models JSON, and NVIDIA model catalog markdown files. b690b81

Add extensive model configurations (chat, reasoning, coding, multimodal, size bands, etc.) to litellm_config.yaml. 36497ae

```markdown 2adee7a

Add `database_url` to `litellm_config.yaml` and document admin panel requirements in README. d99134f

initial commit 951381f

Remove stale model entries and tidy configuration

1ce6135

Add qwen3.5‑397b‑a17b model entries (NVIDIA and Ollama) and replace all hard‑coded NVIDIA `api_base` URLs with `os.environ/NVIDIA_BASE_URL` in `litellm_config.yaml`.

df2661e

Add `order`, `rpm`, `max_parallel_requests`, and `tags` fields to model configurations; extend `context_window_fallbacks` and `fallbacks` for planner, executor, and leaf‑operator.

b8332e3

Add `power` NVIDIA‑only alias with ordered frontier ladder, configure its models in `litellm_config.yaml`, and tighten retry limits (set `allowed_fails` to 1 and reduce timeout/rate‑limit retries to 1).

f6a364f

Add `input_type: "passage"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml.

4098a2d

Add `encoding_format: "float"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml.

bddd47b

Add `encoding_format: "float"` to NVIDIA nv‑embedcode‑7b‑v1 model entries in litellm_config.yaml.

bad0411

Add code embedding model (nvidia/nv-embedcode-7b-v1) configuration to litellm_config.yaml.

1b8cdb3

Add NVIDIA nv‑embedcode‑7b‑v1 embedding model configuration to litellm_config.yaml.

54d6bca

Add Gemini rate‑limits markdown, Gemini models JSON, and NVIDIA model catalog markdown files.

b690b81

Add extensive model configurations (chat, reasoning, coding, multimodal, size bands, etc.) to litellm_config.yaml.

36497ae

```markdown

2adee7a

Add `database_url` to `litellm_config.yaml` and document admin panel requirements in README.

d99134f

initial commit

951381f