Document small model benefits and secrets

#1
by asusevski - opened
Files changed (1) hide show
  1. README.md +8 -1
README.md CHANGED
@@ -38,4 +38,11 @@ Dota Draft is an AI-assisted drafting workbench for Dota 2 Captains Mode. It let
38
 
39
  The app is a Gradio Space. The draft UI is implemented in `app.py`, the recommendation agent lives in `agent.py`, and the OpenDota MCP connection is defined in `mcp.py`.
40
 
41
- The default model is `Qwen/Qwen3.6-27B:featherless-ai`, which is under the Build Small 32B parameter cap. Runtime configuration is provided through environment variables such as `OPENAI_API_KEY`, `OPENAI_MODEL`, and `OPENDOTA_API_KEY`.
 
 
 
 
 
 
 
 
38
 
39
  The app is a Gradio Space. The draft UI is implemented in `app.py`, the recommendation agent lives in `agent.py`, and the OpenDota MCP connection is defined in `mcp.py`.
40
 
41
+ The default model is `Qwen/Qwen3.6-27B:featherless-ai`, which is under the Build Small 32B parameter cap. A small language model is a good fit for this draft assistant because the task is narrow and highly structured: the model does not need broad open-ended world knowledge, it needs to rank a constrained hero pool against a compact draft state and the latest OpenDota context. Keeping the model small reduces latency, cost, and infrastructure requirements, which makes the app easier to run on a public CPU Space while still leaving the strategic reasoning in an inspectable prompt-and-tool loop.
42
+
43
+ Runtime configuration is provided through Space Secrets and environment variables:
44
+
45
+ - `OPENAI_API_KEY`: Hugging Face token used by the OpenAI-compatible Hugging Face Router.
46
+ - `HF_TOKEN`: Hugging Face token available to Hub-aware libraries if needed.
47
+ - `OPENAI_MODEL`: optional model override.
48
+ - `OPENDOTA_API_KEY`: optional OpenDota API key for higher-rate OpenDota requests.