rl-bus-optimizer / inference.py

Commit History

fix: move info prints to stderr and use comma-separated rewards in [END] tag for validator compliance
30bf3bb

voldemort6996 commited on

fix: expand task suite to 5 tasks for maximum validation redundancy
61203a1

voldemort6996 commited on

fix: refactor to 3 explicit static tasks to guarantee validator discovery
f46e34d

voldemort6996 commited on

chore: implement strict compliance fixes for Phase 2 evaluation
2ba76c3

voldemort6996 commited on

fix: restore LLM proxy compliance - Reverts default mode to 'llm' - Adds support for API_KEY env var - Caps episodes to 1 to stay under 20min timeout
65a2882

voldemort6996 commited on

compliance: match literal HF_TOKEN initialization from portal checklist (no defaults)
1aca32b

voldemort6996 commited on

fix: align inference.py logging with trajectory sample script - Emits [STEP] for every environment step - Includes success threshold and reward list in [END] - Runtime: 0.13s
2c5a182

voldemort6996 commited on

fix: enforce strict [START]/[STEP]/[END] granular logging for Phase 2 compliance - Emits logs for each task (easy, medium, hard) - Aligns with mandatory field ordering and naming
3bd139e

voldemort6996 commited on

fix: default to DQN mode in inference.py to prevent 30min timeout - Switch default from llm to dqn (0.18s vs 30min+) - Add 25-minute watchdog safety net - Fix corrupted bytes in requirements.txt - LLM mode still available via --mode llm
126110b

voldemort6996 commited on

Compliance: Set 'openai/gpt-oss-120b:free' as default in inference.py
dbce134

voldemort6996 commited on

FIX: Correct OpenRouter Model ID to google/gemma-3-27b-it:free
cf4a67c

voldemort6996 commited on

Compliance: Fully aligned project with OpenEnv requirements (API, logging, and structure)
9906627

voldemort6996 commited on

feat: Dueling DDQN + PER, GTFS demand profiles, convergence analytics, premium UI
fb1c248

voldemort6996 commited on

feat: complete premium hackathon upgrades with DDQN, XAI, and Compare Mode
001e2b3

voldemort6996 commited on