Spaces:
Sleeping
Sleeping
Upload folder using huggingface_hub
Browse files- README.md +17 -8
- requirements.txt +1 -1
README.md
CHANGED
|
@@ -4,7 +4,7 @@ emoji: 🧠
|
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: green
|
| 6 |
sdk: gradio
|
| 7 |
-
sdk_version: 4.
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
tags:
|
|
@@ -12,16 +12,25 @@ tags:
|
|
| 12 |
- reinforcement-learning
|
| 13 |
- planning
|
| 14 |
- ai-education
|
|
|
|
| 15 |
---
|
| 16 |
|
| 17 |
-
#
|
| 18 |
|
| 19 |
-
Interactive
|
| 20 |
|
| 21 |
## Features
|
| 22 |
-
- Visual grid environment with agent, goal, and obstacles
|
| 23 |
-
- Phase-based learning cycle (Observe → Predict → Act → Learn)
|
| 24 |
-
- Real-time model accuracy tracking
|
| 25 |
-
- Educational visualization of AI planning
|
| 26 |
|
| 27 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: green
|
| 6 |
sdk: gradio
|
| 7 |
+
sdk_version: 4.44.0
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
tags:
|
|
|
|
| 12 |
- reinforcement-learning
|
| 13 |
- planning
|
| 14 |
- ai-education
|
| 15 |
+
- cognitive-architecture
|
| 16 |
---
|
| 17 |
|
| 18 |
+
# World Model Demo
|
| 19 |
|
| 20 |
+
Interactive visualization of model-based reinforcement learning concepts, inspired by architectures like MuZero and Dreamer.
|
| 21 |
|
| 22 |
## Features
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
|
| 24 |
+
- **Mental Simulation**: Watch the AI plan ahead by imagining future states
|
| 25 |
+
- **Tree Search Visualization**: See how different action sequences are evaluated
|
| 26 |
+
- **Configurable Environment**: Adjust grid size and obstacles
|
| 27 |
+
- **Step-by-Step Mode**: Understand each phase of the planning process
|
| 28 |
+
|
| 29 |
+
## How It Works
|
| 30 |
+
|
| 31 |
+
1. **Observe**: Agent perceives current grid state
|
| 32 |
+
2. **Imagine**: World model predicts outcomes of possible actions
|
| 33 |
+
3. **Plan**: Tree search evaluates action sequences
|
| 34 |
+
4. **Act**: Best action is selected and executed
|
| 35 |
+
|
| 36 |
+
This demonstrates key concepts from model-based RL without requiring actual neural networks.
|
requirements.txt
CHANGED
|
@@ -1 +1 @@
|
|
| 1 |
-
|
|
|
|
| 1 |
+
# Dependencies managed by sdk_version in README.md
|