anthonym21 commited on
Commit
c5931ad
·
verified ·
1 Parent(s): 0ab3945

Upload folder using huggingface_hub

Browse files
Files changed (2) hide show
  1. README.md +17 -8
  2. requirements.txt +1 -1
README.md CHANGED
@@ -4,7 +4,7 @@ emoji: 🧠
4
  colorFrom: blue
5
  colorTo: green
6
  sdk: gradio
7
- sdk_version: 4.36.0
8
  app_file: app.py
9
  pinned: false
10
  tags:
@@ -12,16 +12,25 @@ tags:
12
  - reinforcement-learning
13
  - planning
14
  - ai-education
 
15
  ---
16
 
17
- # 🧠 World Model Demo
18
 
19
- Interactive demonstration of world model concepts in AI planning.
20
 
21
  ## Features
22
- - Visual grid environment with agent, goal, and obstacles
23
- - Phase-based learning cycle (Observe → Predict → Act → Learn)
24
- - Real-time model accuracy tracking
25
- - Educational visualization of AI planning
26
 
27
- Built for exploring model-based reinforcement learning concepts.
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  colorFrom: blue
5
  colorTo: green
6
  sdk: gradio
7
+ sdk_version: 4.44.0
8
  app_file: app.py
9
  pinned: false
10
  tags:
 
12
  - reinforcement-learning
13
  - planning
14
  - ai-education
15
+ - cognitive-architecture
16
  ---
17
 
18
+ # World Model Demo
19
 
20
+ Interactive visualization of model-based reinforcement learning concepts, inspired by architectures like MuZero and Dreamer.
21
 
22
  ## Features
 
 
 
 
23
 
24
+ - **Mental Simulation**: Watch the AI plan ahead by imagining future states
25
+ - **Tree Search Visualization**: See how different action sequences are evaluated
26
+ - **Configurable Environment**: Adjust grid size and obstacles
27
+ - **Step-by-Step Mode**: Understand each phase of the planning process
28
+
29
+ ## How It Works
30
+
31
+ 1. **Observe**: Agent perceives current grid state
32
+ 2. **Imagine**: World model predicts outcomes of possible actions
33
+ 3. **Plan**: Tree search evaluates action sequences
34
+ 4. **Act**: Best action is selected and executed
35
+
36
+ This demonstrates key concepts from model-based RL without requiring actual neural networks.
requirements.txt CHANGED
@@ -1 +1 @@
1
- gradio>=5.0.0
 
1
+ # Dependencies managed by sdk_version in README.md