cajcodes
/

dqn-floorplan-navigator

+---
+language: en
+tags:
+- deep-q-network
+- reinforcement-learning
+- pathfinding
+- hospital-floorplan
+license: apache-2.0
+datasets:
+- custom
+metrics:
+- average_reward
+- success_rate
+---
+# Deep Q-Network for Hospital Floorplan Navigation
+## Model Description
+This model is a Deep Q-Network (DQN) designed to find the most efficient path through a hospital floorplan for wheeling a bed without hitting obstacles. The model combines traditional pathfinding algorithms with reinforcement learning for optimal performance.
+## Model Architecture
+The model is a fully connected neural network with the following architecture:
+- Input Layer: Flattened grid representation of the floorplan
+- Hidden Layers: Two hidden layers with 64 units each and ReLU activation
+- Output Layer: Four units representing the possible actions (up, down, left, right)
+## Training
+The model was trained using a hybrid approach:
+1. **A* Algorithm**: Initially, the A* algorithm was used to find the shortest path in a static environment.
+2. **Reinforcement Learning**: The DQN was trained with guidance from the A* path to improve efficiency and adaptability.
+### Hyperparameters
+- Learning Rate: 0.001
+- Batch Size: 64
+- Gamma (Discount Factor): 0.99
+- Target Update Frequency: Every 100 episodes
+- Number of Episodes: 50
+## Usage
+To use this model, load the saved state dictionary and initialize the DQN with the same architecture. The model can then be used to navigate a hospital floorplan and find the most efficient path to the target.
+### Example Code
+```python
+import torch
+# Define the DQN class (same as in the training script)
+class DQN(nn.Module):
+    def __init__(self, input_size, hidden_sizes, output_size):
+        super(DQN, self).__init__()
+        self.input_size = input_size
+        self.hidden_sizes = hidden_sizes
+        self.output_size = output_size
+        self.fc_layers = nn.ModuleList()
+        prev_size = input_size
+        for size in hidden_sizes:
+            self.fc_layers.append(nn.Linear(prev_size, size))
+            prev_size = size
+        self.output_layer = nn.Linear(prev_size, output_size)
+    def forward(self, x):
+        if len(x.shape) > 2:
+            x = x.view(x.size(0), -1)
+        for layer in self.fc_layers:
+            x = F.relu(layer(x))
+        x = self.output_layer(x)
+        return x
+# Load the model
+input_size = 100  # 10x10 grid flattened
+hidden_sizes = [64, 64]
+output_size = 4
+model = DQN(input_size, hidden_sizes, output_size)
+model.load_state_dict(torch.load('dqn_model.pth'))
+model.eval()
+# Use the model for inference (example state)
+state = ...  # Define your state here
+with torch.no_grad():
+    action = model(torch.tensor(state, dtype=torch.float32).unsqueeze(0)).argmax().item()
+```
+## Evaluation
+The model was evaluated based on:
+- Average Reward: The mean reward over several episodes
+- Success Rate: The proportion of episodes where the agent successfully reached the target
+## Initial Evaluation Results
+- Average Reward: 8.84
+- Success Rate: 1.0
+## Limitations
+- The model's performance can be influenced by the complexity of the floorplan and the density of obstacles.
+- It requires a grid-based representation of the environment for accurate navigation.
+## Acknowledgements
+This project leverages the power of reinforcement learning combined with traditional pathfinding algorithms to navigate complex environments efficiently.
+## Citation
+If you use this model in your research, please cite it as follows:
+```
+@misc{jones2024dqnhospital,
+author = {Christopher Jones},
+title = {Deep Q-Network for Floorplan Navigation},
+year = {2024},
+howpublished = {\url{https://huggingface.co/cajcodes/dqn-hospital-floorplan}},
+note = {Accessed: YYYY-MM-DD}
+}
+```