jatinror commited on
Commit
cec6cd8
·
verified ·
1 Parent(s): e08a5e9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -47
README.md CHANGED
@@ -1,47 +1,36 @@
1
- ---
2
- license: apache-2.0
3
- tags:
4
- - reinforcement-learning
5
- - q-learning
6
- - taxi-agent
7
- - gym
8
- - python
9
- library_name: gym
10
- ---
11
-
12
- # Taxi-v3 Q-Learning Agent
13
-
14
- This repository contains a trained Q-learning agent for the Taxi-v3 environment from OpenAI Gym. The agent is stored as a `q_table.npy` file.
15
-
16
-
17
-
18
-
19
-
20
-
21
- # RL-Taxi Agent
22
-
23
- This project implements a **Q-learning agent** for the **OpenAI Gym Taxi-v3 environment**.
24
- The agent is trained to pick up and drop off passengers efficiently while maximizing the total reward.
25
-
26
- ---
27
-
28
- ## Features
29
-
30
- - **Environment:** OpenAI Gym `Taxi-v3`
31
- - **Algorithm:** Q-learning
32
- - **Visualization:**
33
- - Terminal ASCII render of the Taxi environment
34
- - Optional `pygame` GUI render
35
- - **Trained Model:** Q-table stored on [Hugging Face](https://huggingface.co/jatinror/taxi-q-table/resolve/main/q_table.npy)
36
- - **Python 3.10+** compatible
37
- - **Direct Hugging Face Integration:** Load Q-table without storing locally
38
-
39
- ---
40
-
41
- ## Installation
42
-
43
- 1. Clone the repository:
44
-
45
- ```bash
46
- git clone <your-repo-url>
47
- cd RL-Taxi
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - reinforcement-learning
5
+ - q-learning
6
+ - taxi-agent
7
+ - gym
8
+ - python
9
+ library_name: gym
10
+ ---
11
+
12
+ # 🚖 Taxi-v3 Q-Learning Agent
13
+
14
+ This repository contains a trained **Q-learning agent** for the `Taxi-v3` environment.
15
+
16
+ The agent learns to efficiently pick up and drop off passengers while maximizing reward using a tabular Q-learning approach.
17
+
18
+ The trained model is packaged as `model.zip`, which contains the learned `q_table.npy`.
19
+
20
+ ---
21
+
22
+ ## 📌 Environment Details
23
+
24
+ - **Environment:** `Taxi-v3`
25
+ - **Algorithm:** Q-learning (Tabular RL)
26
+ - **State Space:** 500 discrete states
27
+ - **Action Space:** 6 discrete actions
28
+ - **Training Episodes:** 20,000+
29
+ - **Framework:** OpenAI Gym / Gymnasium compatible
30
+
31
+ ---
32
+
33
+ ## 📦 Model File
34
+
35
+ The trained agent is stored as:
36
+