Spaces:

voldemort6996
/

rl-bus-optimizer

Running

App Files Files Community

rl-bus-optimizer

Commit History

Fix: Add grade_task_6 and grade_task_7 functions, update all exports, update grade_all_tasks to handle 7 tasks

6a637d1

Running

voldemort6996 commited on about 2 hours ago

Fix: Revert grade_task functions to return float for OpenEnv validator compatibility (Dict return caused validation failure)

8887da7

voldemort6996 commited on about 2 hours ago

Fix: Expand environment stop limit from 12 to 50 to support task_6 (20 stops) and task_7 (25 stops)

13825cb

voldemort6996 commited on about 2 hours ago

Major enhancements: Statistical testing (scipy.stats), large-scale tasks (20/25 stops), OR-Tools/MPC baselines, README visualizations, score range fix (0.01-0.99)

3d67b1a

voldemort6996 commited on about 2 hours ago

Fix: Expose grader functions for OpenEnv validator detection

7263359

Pranav Chaudhari commited on about 19 hours ago

Critical fix: Score range validation - Change 0.0-1.0 to 0.01-0.99

442ff00

voldemort6996 commited on about 3 hours ago

Fix OpenEnv grader detection - Add all exports to tasks.py and grader.py

8f286e6

voldemort6996 commited on about 19 hours ago

fix: sync grader.py report labels with 5-task suite

b8e6a14

voldemort6996 commited on about 19 hours ago

fix: move info prints to stderr and use comma-separated rewards in [END] tag for validator compliance

30bf3bb

voldemort6996 commited on about 20 hours ago

fix: expand task suite to 5 tasks for maximum validation redundancy

61203a1

voldemort6996 commited on about 21 hours ago

fix: adopt explicit colon notation for python and grader fields in openenv.yaml

e0d3cb9

voldemort6996 commited on about 21 hours ago

fix: restore legacy map for task_11 and task_21 to prevent Gradio browser cache errors

400305a

voldemort6996 commited on about 22 hours ago

fix: explicitly embed 'python' and 'grader' bindings inline to bypass OpenEnv validator schema drops

94c1d9b

voldemort6996 commited on about 22 hours ago

fix: clean unused imports and adopt strict bot format

b2610e6

voldemort6996 commited on about 23 hours ago

fix: correct '30 tasks' string literals to '3 tasks' in grader.py explicitly

e231d19

voldemort6996 commited on about 24 hours ago

fix: refactor to 3 explicit static tasks to guarantee validator discovery

f46e34d

voldemort6996 commited on about 24 hours ago

chore: final pre-submission sync

b44a9c6

voldemort6996 commited on 1 day ago

fix: force UI difficulty mapping exactly at entrypoint

ba914d8

voldemort6996 commited on 1 day ago

chore: force trigger Hugging Face rebuild

3fb4224

voldemort6996 commited on 1 day ago

fix: restore backward compatibility for app.py task lookup

32403a9

voldemort6996 commited on 1 day ago

chore: implement strict compliance fixes for Phase 2 evaluation

2ba76c3

voldemort6996 commited on 1 day ago

feat: expand task suite to 30 tasks with graders for Phase 2 compliance

507cd2c

voldemort6996 commited on 1 day ago

fix: restore LLM proxy compliance - Reverts default mode to 'llm' - Adds support for API_KEY env var - Caps episodes to 1 to stay under 20min timeout

65a2882

voldemort6996 commited on 2 days ago

compliance: match literal HF_TOKEN initialization from portal checklist (no defaults)

1aca32b

voldemort6996 commited on 2 days ago

fix: align inference.py logging with trajectory sample script - Emits [STEP] for every environment step - Includes success threshold and reward list in [END] - Runtime: 0.13s

2c5a182

voldemort6996 commited on 2 days ago

fix: enforce strict [START]/[STEP]/[END] granular logging for Phase 2 compliance - Emits logs for each task (easy, medium, hard) - Aligns with mandatory field ordering and naming

3bd139e

voldemort6996 commited on 2 days ago

fix: apply_what_if queue bug - extend list instead of adding int to list

a4aaa93

voldemort6996 commited on 2 days ago

fix: default to DQN mode in inference.py to prevent 30min timeout - Switch default from llm to dqn (0.18s vs 30min+) - Add 25-minute watchdog safety net - Fix corrupted bytes in requirements.txt - LLM mode still available via --mode llm

126110b

voldemort6996 commited on 2 days ago

Compliance: Restore README frontmatter and finalize v1.2.0 features

ca3950c

voldemort6996 commited on 3 days ago

Compliance: Set 'openai/gpt-oss-120b:free' as default in inference.py

dbce134

voldemort6996 commited on 3 days ago

Premium: Add Streaming and Neural Reasoning Load (Reasoning Tokens) tracking

15b8082

voldemort6996 commited on 3 days ago

Resilience: Add Hugging Face Inference API as Tier-2 fallback

e25bd6c

voldemort6996 commited on 3 days ago

Reliability: Implement model rotation to bypass OpenRouter rate limits

354e5b9

voldemort6996 commited on 3 days ago

LLM: Upgrade to Strategic Strategy and improve Dashboard differentiation

f7bffb3

voldemort6996 commited on 3 days ago

FIX: Correct OpenRouter Model ID to google/gemma-3-27b-it:free

cf4a67c

voldemort6996 commited on 3 days ago

UI: Add detailed OpenRouter error diagnostics and required headers

3730947

voldemort6996 commited on 3 days ago

FIX: Ensure 'AUTORUN' respects LLM selection and add API Connectivity Tester

fb76b46

voldemort6996 commited on 3 days ago

UI: Add Live LLM Optimizer (OpenRouter) toggle and reasoning panel

477a062

voldemort6996 commited on 3 days ago

UI: Overhaul with Premium Apple-Style Metrics and Sidebar Layout

dfd3542

voldemort6996 commited on 3 days ago

Docs: Update README with new architecture and compliance results

6ef6372

voldemort6996 commited on 3 days ago

Compliance: Fully aligned project with OpenEnv requirements (API, logging, and structure)

9906627

voldemort6996 commited on 3 days ago

docs: add Live Demo link to README

0c86254

voldemort6996 commited on 4 days ago

feat: Dueling DDQN + PER, GTFS demand profiles, convergence analytics, premium UI

fb1c248

voldemort6996 commited on 4 days ago

feat: complete premium hackathon upgrades with DDQN, XAI, and Compare Mode

001e2b3

voldemort6996 commited on 11 days ago

fixed merge conflict

dab4c77

voldemort6996 commited on 11 days ago

Initial commit - Mini RL Bus Project

ef11b18

voldemort6996 commited on 11 days ago

Initial commit

417b4f0
unverified

voldemort6996 commited on 11 days ago

Commit History

Fix: Add grade_task_6 and grade_task_7 functions, update __all__ exports, update grade_all_tasks to handle 7 tasks 6a637d1 Running

Fix: Revert grade_task functions to return float for OpenEnv validator compatibility (Dict return caused validation failure) 8887da7

Fix: Expand environment stop limit from 12 to 50 to support task_6 (20 stops) and task_7 (25 stops) 13825cb

Major enhancements: Statistical testing (scipy.stats), large-scale tasks (20/25 stops), OR-Tools/MPC baselines, README visualizations, score range fix (0.01-0.99) 3d67b1a

Fix: Expose grader functions for OpenEnv validator detection 7263359

Critical fix: Score range validation - Change 0.0-1.0 to 0.01-0.99 442ff00

Fix OpenEnv grader detection - Add __all__ exports to tasks.py and grader.py 8f286e6

fix: sync grader.py report labels with 5-task suite b8e6a14

fix: move info prints to stderr and use comma-separated rewards in [END] tag for validator compliance 30bf3bb

fix: expand task suite to 5 tasks for maximum validation redundancy 61203a1

fix: adopt explicit colon notation for python and grader fields in openenv.yaml e0d3cb9

fix: restore legacy map for task_11 and task_21 to prevent Gradio browser cache errors 400305a

fix: explicitly embed 'python' and 'grader' bindings inline to bypass OpenEnv validator schema drops 94c1d9b

fix: clean unused imports and adopt strict bot format b2610e6

fix: correct '30 tasks' string literals to '3 tasks' in grader.py explicitly e231d19

fix: refactor to 3 explicit static tasks to guarantee validator discovery f46e34d

chore: final pre-submission sync b44a9c6

fix: force UI difficulty mapping exactly at entrypoint ba914d8

chore: force trigger Hugging Face rebuild 3fb4224

fix: restore backward compatibility for app.py task lookup 32403a9

chore: implement strict compliance fixes for Phase 2 evaluation 2ba76c3

feat: expand task suite to 30 tasks with graders for Phase 2 compliance 507cd2c

fix: restore LLM proxy compliance - Reverts default mode to 'llm' - Adds support for API_KEY env var - Caps episodes to 1 to stay under 20min timeout 65a2882

compliance: match literal HF_TOKEN initialization from portal checklist (no defaults) 1aca32b

fix: align inference.py logging with trajectory sample script - Emits [STEP] for every environment step - Includes success threshold and reward list in [END] - Runtime: 0.13s 2c5a182

fix: enforce strict [START]/[STEP]/[END] granular logging for Phase 2 compliance - Emits logs for each task (easy, medium, hard) - Aligns with mandatory field ordering and naming 3bd139e

fix: apply_what_if queue bug - extend list instead of adding int to list a4aaa93

fix: default to DQN mode in inference.py to prevent 30min timeout - Switch default from llm to dqn (0.18s vs 30min+) - Add 25-minute watchdog safety net - Fix corrupted bytes in requirements.txt - LLM mode still available via --mode llm 126110b

Compliance: Restore README frontmatter and finalize v1.2.0 features ca3950c

Compliance: Set 'openai/gpt-oss-120b:free' as default in inference.py dbce134

Premium: Add Streaming and Neural Reasoning Load (Reasoning Tokens) tracking 15b8082

Resilience: Add Hugging Face Inference API as Tier-2 fallback e25bd6c

Reliability: Implement model rotation to bypass OpenRouter rate limits 354e5b9

LLM: Upgrade to Strategic Strategy and improve Dashboard differentiation f7bffb3

FIX: Correct OpenRouter Model ID to google/gemma-3-27b-it:free cf4a67c

UI: Add detailed OpenRouter error diagnostics and required headers 3730947

FIX: Ensure 'AUTORUN' respects LLM selection and add API Connectivity Tester fb76b46

UI: Add Live LLM Optimizer (OpenRouter) toggle and reasoning panel 477a062

UI: Overhaul with Premium Apple-Style Metrics and Sidebar Layout dfd3542

Docs: Update README with new architecture and compliance results 6ef6372

Compliance: Fully aligned project with OpenEnv requirements (API, logging, and structure) 9906627

docs: add Live Demo link to README 0c86254

feat: Dueling DDQN + PER, GTFS demand profiles, convergence analytics, premium UI fb1c248

feat: complete premium hackathon upgrades with DDQN, XAI, and Compare Mode 001e2b3

fixed merge conflict dab4c77

Initial commit - Mini RL Bus Project ef11b18

Initial commit 417b4f0 unverified

Fix: Add grade_task_6 and grade_task_7 functions, update all exports, update grade_all_tasks to handle 7 tasks

6a637d1

Running

Fix: Revert grade_task functions to return float for OpenEnv validator compatibility (Dict return caused validation failure)

8887da7

Fix: Expand environment stop limit from 12 to 50 to support task_6 (20 stops) and task_7 (25 stops)

13825cb

Major enhancements: Statistical testing (scipy.stats), large-scale tasks (20/25 stops), OR-Tools/MPC baselines, README visualizations, score range fix (0.01-0.99)

3d67b1a

Fix: Expose grader functions for OpenEnv validator detection

7263359

Critical fix: Score range validation - Change 0.0-1.0 to 0.01-0.99

442ff00

Fix OpenEnv grader detection - Add all exports to tasks.py and grader.py

8f286e6

fix: sync grader.py report labels with 5-task suite

b8e6a14

fix: move info prints to stderr and use comma-separated rewards in [END] tag for validator compliance

30bf3bb

fix: expand task suite to 5 tasks for maximum validation redundancy

61203a1

fix: adopt explicit colon notation for python and grader fields in openenv.yaml

e0d3cb9

fix: restore legacy map for task_11 and task_21 to prevent Gradio browser cache errors

400305a

fix: explicitly embed 'python' and 'grader' bindings inline to bypass OpenEnv validator schema drops

94c1d9b

fix: clean unused imports and adopt strict bot format

b2610e6

fix: correct '30 tasks' string literals to '3 tasks' in grader.py explicitly

e231d19

fix: refactor to 3 explicit static tasks to guarantee validator discovery

f46e34d

chore: final pre-submission sync

b44a9c6

fix: force UI difficulty mapping exactly at entrypoint

ba914d8

chore: force trigger Hugging Face rebuild

3fb4224

fix: restore backward compatibility for app.py task lookup

32403a9

chore: implement strict compliance fixes for Phase 2 evaluation

2ba76c3

feat: expand task suite to 30 tasks with graders for Phase 2 compliance

507cd2c

fix: restore LLM proxy compliance - Reverts default mode to 'llm' - Adds support for API_KEY env var - Caps episodes to 1 to stay under 20min timeout

65a2882

compliance: match literal HF_TOKEN initialization from portal checklist (no defaults)

1aca32b

fix: align inference.py logging with trajectory sample script - Emits [STEP] for every environment step - Includes success threshold and reward list in [END] - Runtime: 0.13s

2c5a182

fix: enforce strict [START]/[STEP]/[END] granular logging for Phase 2 compliance - Emits logs for each task (easy, medium, hard) - Aligns with mandatory field ordering and naming

3bd139e

fix: apply_what_if queue bug - extend list instead of adding int to list

a4aaa93

fix: default to DQN mode in inference.py to prevent 30min timeout - Switch default from llm to dqn (0.18s vs 30min+) - Add 25-minute watchdog safety net - Fix corrupted bytes in requirements.txt - LLM mode still available via --mode llm

126110b

Compliance: Restore README frontmatter and finalize v1.2.0 features

ca3950c

Compliance: Set 'openai/gpt-oss-120b:free' as default in inference.py

dbce134

Premium: Add Streaming and Neural Reasoning Load (Reasoning Tokens) tracking

15b8082

Resilience: Add Hugging Face Inference API as Tier-2 fallback

e25bd6c

Reliability: Implement model rotation to bypass OpenRouter rate limits

354e5b9

LLM: Upgrade to Strategic Strategy and improve Dashboard differentiation

f7bffb3

FIX: Correct OpenRouter Model ID to google/gemma-3-27b-it:free

cf4a67c

UI: Add detailed OpenRouter error diagnostics and required headers

3730947

FIX: Ensure 'AUTORUN' respects LLM selection and add API Connectivity Tester

fb76b46

UI: Add Live LLM Optimizer (OpenRouter) toggle and reasoning panel

477a062

UI: Overhaul with Premium Apple-Style Metrics and Sidebar Layout

dfd3542

Docs: Update README with new architecture and compliance results

6ef6372

Compliance: Fully aligned project with OpenEnv requirements (API, logging, and structure)

9906627

docs: add Live Demo link to README

0c86254

feat: Dueling DDQN + PER, GTFS demand profiles, convergence analytics, premium UI

fb1c248

feat: complete premium hackathon upgrades with DDQN, XAI, and Compare Mode

001e2b3

fixed merge conflict

dab4c77

Initial commit - Mini RL Bus Project

ef11b18

Initial commit

417b4f0
unverified