Spaces:

plarnholt
/

excom-ai-demo

Paused

Peter Larnholt commited on Oct 9

Commit

2e9c870

1 Parent(s): 3356350

Fix invalid --disable-guided-decoding flag and add airportsdata dependency

The --disable-guided-decoding flag doesn't exist in vLLM 0.6.3.post1.
Instead, ensure outlines backend works properly by adding airportsdata
dependency which is required for guided decoding imports.

Files changed (2) hide show

app.py +0 -1
requirements.txt +3 -0

app.py CHANGED Viewed

@@ -27,7 +27,6 @@ VLLM_ARGS = [
     "--gpu-memory-utilization", "0.90",
     "--trust-remote-code",
     "--disable-log-requests",                # reduce log noise
-    "--disable-guided-decoding",             # skip guided decoding (outlines) to avoid import issues
 ]
 if "AWQ" in MODEL_ID.upper():
     VLLM_ARGS += ["--quantization", "awq_marlin"]  # faster AWQ kernel if available

     "--gpu-memory-utilization", "0.90",
     "--trust-remote-code",
     "--disable-log-requests",                # reduce log noise
 ]
 if "AWQ" in MODEL_ID.upper():
     VLLM_ARGS += ["--quantization", "awq_marlin"]  # faster AWQ kernel if available

requirements.txt CHANGED Viewed

@@ -9,3 +9,6 @@ vllm==0.6.3.post1
 torch==2.4.0
 transformers>=4.44
 accelerate>=0.30

 torch==2.4.0
 transformers>=4.44
 accelerate>=0.30
+# Required for vLLM's outlines guided decoding backend
+airportsdata>=20240400