Commit History

Fix: Add grade_task_6 and grade_task_7 functions, update __all__ exports, update grade_all_tasks to handle 7 tasks
6a637d1
Running

voldemort6996 commited on

Fix: Revert grade_task functions to return float for OpenEnv validator compatibility (Dict return caused validation failure)
8887da7

voldemort6996 commited on

Fix: Expand environment stop limit from 12 to 50 to support task_6 (20 stops) and task_7 (25 stops)
13825cb

voldemort6996 commited on

Major enhancements: Statistical testing (scipy.stats), large-scale tasks (20/25 stops), OR-Tools/MPC baselines, README visualizations, score range fix (0.01-0.99)
3d67b1a

voldemort6996 commited on

Fix: Expose grader functions for OpenEnv validator detection
7263359

Pranav Chaudhari commited on

Critical fix: Score range validation - Change 0.0-1.0 to 0.01-0.99
442ff00

voldemort6996 commited on

Fix OpenEnv grader detection - Add __all__ exports to tasks.py and grader.py
8f286e6

voldemort6996 commited on

fix: sync grader.py report labels with 5-task suite
b8e6a14

voldemort6996 commited on

fix: move info prints to stderr and use comma-separated rewards in [END] tag for validator compliance
30bf3bb

voldemort6996 commited on

fix: expand task suite to 5 tasks for maximum validation redundancy
61203a1

voldemort6996 commited on

fix: adopt explicit colon notation for python and grader fields in openenv.yaml
e0d3cb9

voldemort6996 commited on

fix: restore legacy map for task_11 and task_21 to prevent Gradio browser cache errors
400305a

voldemort6996 commited on

fix: explicitly embed 'python' and 'grader' bindings inline to bypass OpenEnv validator schema drops
94c1d9b

voldemort6996 commited on

fix: clean unused imports and adopt strict bot format
b2610e6

voldemort6996 commited on

fix: correct '30 tasks' string literals to '3 tasks' in grader.py explicitly
e231d19

voldemort6996 commited on

fix: refactor to 3 explicit static tasks to guarantee validator discovery
f46e34d

voldemort6996 commited on

chore: final pre-submission sync
b44a9c6

voldemort6996 commited on

fix: force UI difficulty mapping exactly at entrypoint
ba914d8

voldemort6996 commited on

chore: force trigger Hugging Face rebuild
3fb4224

voldemort6996 commited on

fix: restore backward compatibility for app.py task lookup
32403a9

voldemort6996 commited on

chore: implement strict compliance fixes for Phase 2 evaluation
2ba76c3

voldemort6996 commited on

feat: expand task suite to 30 tasks with graders for Phase 2 compliance
507cd2c

voldemort6996 commited on

fix: restore LLM proxy compliance - Reverts default mode to 'llm' - Adds support for API_KEY env var - Caps episodes to 1 to stay under 20min timeout
65a2882

voldemort6996 commited on

compliance: match literal HF_TOKEN initialization from portal checklist (no defaults)
1aca32b

voldemort6996 commited on

fix: align inference.py logging with trajectory sample script - Emits [STEP] for every environment step - Includes success threshold and reward list in [END] - Runtime: 0.13s
2c5a182

voldemort6996 commited on

fix: enforce strict [START]/[STEP]/[END] granular logging for Phase 2 compliance - Emits logs for each task (easy, medium, hard) - Aligns with mandatory field ordering and naming
3bd139e

voldemort6996 commited on

fix: apply_what_if queue bug - extend list instead of adding int to list
a4aaa93

voldemort6996 commited on

fix: default to DQN mode in inference.py to prevent 30min timeout - Switch default from llm to dqn (0.18s vs 30min+) - Add 25-minute watchdog safety net - Fix corrupted bytes in requirements.txt - LLM mode still available via --mode llm
126110b

voldemort6996 commited on

Compliance: Restore README frontmatter and finalize v1.2.0 features
ca3950c

voldemort6996 commited on

Compliance: Set 'openai/gpt-oss-120b:free' as default in inference.py
dbce134

voldemort6996 commited on

Premium: Add Streaming and Neural Reasoning Load (Reasoning Tokens) tracking
15b8082

voldemort6996 commited on

Resilience: Add Hugging Face Inference API as Tier-2 fallback
e25bd6c

voldemort6996 commited on

Reliability: Implement model rotation to bypass OpenRouter rate limits
354e5b9

voldemort6996 commited on

LLM: Upgrade to Strategic Strategy and improve Dashboard differentiation
f7bffb3

voldemort6996 commited on

FIX: Correct OpenRouter Model ID to google/gemma-3-27b-it:free
cf4a67c

voldemort6996 commited on

UI: Add detailed OpenRouter error diagnostics and required headers
3730947

voldemort6996 commited on

FIX: Ensure 'AUTORUN' respects LLM selection and add API Connectivity Tester
fb76b46

voldemort6996 commited on

UI: Add Live LLM Optimizer (OpenRouter) toggle and reasoning panel
477a062

voldemort6996 commited on

UI: Overhaul with Premium Apple-Style Metrics and Sidebar Layout
dfd3542

voldemort6996 commited on

Docs: Update README with new architecture and compliance results
6ef6372

voldemort6996 commited on

Compliance: Fully aligned project with OpenEnv requirements (API, logging, and structure)
9906627

voldemort6996 commited on

docs: add Live Demo link to README
0c86254

voldemort6996 commited on

feat: Dueling DDQN + PER, GTFS demand profiles, convergence analytics, premium UI
fb1c248

voldemort6996 commited on

feat: complete premium hackathon upgrades with DDQN, XAI, and Compare Mode
001e2b3

voldemort6996 commited on

fixed merge conflict
dab4c77

voldemort6996 commited on

Initial commit - Mini RL Bus Project
ef11b18

voldemort6996 commited on

Initial commit
417b4f0
unverified

voldemort6996 commited on