RoyAalekh commited on
Commit
4407d61
·
1 Parent(s): dcc70a3

Add simplified RL exploration plan for hackathon

Browse files

Based on critical expert review - simplified tabular Q-learning approach:
- Priority scoring only (not full scheduling replacement)
- 3-day implementation timeline
- Explainable decisions for judges
- Rule-based safety constraints
- Fast training (minutes not hours)

Files changed (1) hide show
  1. RL_EXPLORATION_PLAN.md +0 -0
RL_EXPLORATION_PLAN.md ADDED
Binary file (14.9 kB). View file