Add qwen3.5‑397b‑a17b model entries (NVIDIA and Ollama) and replace all hard‑coded NVIDIA `api_base` URLs with `os.environ/NVIDIA_BASE_URL` in `litellm_config.yaml`.
Add `order`, `rpm`, `max_parallel_requests`, and `tags` fields to model configurations; extend `context_window_fallbacks` and `fallbacks` for planner, executor, and leaf‑operator.
Add `power` NVIDIA‑only alias with ordered frontier ladder, configure its models in `litellm_config.yaml`, and tighten retry limits (set `allowed_fails` to 1 and reduce timeout/rate‑limit retries to 1).