FranticUser commited on
Commit
11d4c6f
·
verified ·
1 Parent(s): 58f1526

Upload folder using huggingface_hub

Browse files
Files changed (6) hide show
  1. .gitattributes +1 -0
  2. README.md +42 -0
  3. config.json +113 -0
  4. qtable.npy +3 -0
  5. replay.mp4 +3 -0
  6. results.json +1 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ replay.mp4 filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - Taxi-v3
4
+ - q-learning
5
+ - reinforcement-learning
6
+ - custom-implementation
7
+ model-index:
8
+ - name: Taxi-v3-trained
9
+ results:
10
+ - task:
11
+ type: reinforcement-learning
12
+ name: reinforcement-learning
13
+ dataset:
14
+ name: Taxi-v3
15
+ type: Taxi-v3
16
+ metrics:
17
+ - type: mean_reward
18
+ value: 7.56 +/- 2.71
19
+ name: mean_reward
20
+ verified: false
21
+ ---
22
+
23
+ **Q-Learning** Agent playing **Taxi-v3**
24
+
25
+ This is a trained **Q-Learning** agent for **Taxi-v3**.
26
+
27
+ ## Usage
28
+
29
+ ```python
30
+ import json
31
+ import numpy as np
32
+ import gym
33
+
34
+ qtable = np.load("qtable.npy")
35
+
36
+ with open("config.json") as f:
37
+ config = json.load(f)
38
+
39
+ env = gym.make(config["env_id"])
40
+ model = {**config, "qtable": qtable}
41
+ ```
42
+
config.json ADDED
@@ -0,0 +1,113 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "env_id": "Taxi-v3",
3
+ "max_steps": 99,
4
+ "n_training_episodes": 25000,
5
+ "n_eval_episodes": 100,
6
+ "eval_seed": [
7
+ 16,
8
+ 54,
9
+ 165,
10
+ 177,
11
+ 191,
12
+ 191,
13
+ 120,
14
+ 80,
15
+ 149,
16
+ 178,
17
+ 48,
18
+ 38,
19
+ 6,
20
+ 125,
21
+ 174,
22
+ 73,
23
+ 50,
24
+ 172,
25
+ 100,
26
+ 148,
27
+ 146,
28
+ 6,
29
+ 25,
30
+ 40,
31
+ 68,
32
+ 148,
33
+ 49,
34
+ 167,
35
+ 9,
36
+ 97,
37
+ 164,
38
+ 176,
39
+ 61,
40
+ 7,
41
+ 54,
42
+ 55,
43
+ 161,
44
+ 131,
45
+ 184,
46
+ 51,
47
+ 170,
48
+ 12,
49
+ 120,
50
+ 113,
51
+ 95,
52
+ 126,
53
+ 51,
54
+ 98,
55
+ 36,
56
+ 135,
57
+ 54,
58
+ 82,
59
+ 45,
60
+ 95,
61
+ 89,
62
+ 59,
63
+ 95,
64
+ 124,
65
+ 9,
66
+ 113,
67
+ 58,
68
+ 85,
69
+ 51,
70
+ 134,
71
+ 121,
72
+ 169,
73
+ 105,
74
+ 21,
75
+ 30,
76
+ 11,
77
+ 50,
78
+ 65,
79
+ 12,
80
+ 43,
81
+ 82,
82
+ 145,
83
+ 152,
84
+ 97,
85
+ 106,
86
+ 55,
87
+ 31,
88
+ 85,
89
+ 38,
90
+ 112,
91
+ 102,
92
+ 168,
93
+ 123,
94
+ 97,
95
+ 21,
96
+ 83,
97
+ 158,
98
+ 26,
99
+ 80,
100
+ 63,
101
+ 5,
102
+ 81,
103
+ 32,
104
+ 11,
105
+ 28,
106
+ 148
107
+ ],
108
+ "learning_rate": 0.75,
109
+ "gamma": 0.95,
110
+ "max_epsilon": 1.0,
111
+ "min_epsilon": 0.05,
112
+ "decay_rate": 0.005
113
+ }
qtable.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2a422a58b26cb167cea958130f4bc8329b579ea56010a8759c30f956f7055114
3
+ size 24128
replay.mp4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1480dde05748bb59481baf211c07811f1659649df71c1fd5bddf950cd168637a
3
+ size 128598
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"env_id": "Taxi-v3", "mean_reward": 7.56, "n_eval_episodes": 100, "eval_datetime": "2026-01-28T04:03:11.455336"}