seynath commited on
Commit
ce0ccf5
·
verified ·
1 Parent(s): 7eaf640

Upload folder using huggingface_hub

Browse files
Files changed (8) hide show
  1. .gitattributes +1 -0
  2. PPO-Taxi-v3.zip +3 -0
  3. README.md +60 -0
  4. algorithm.zip +3 -0
  5. replay.mp4 +3 -0
  6. results.json +1 -0
  7. system.json +1 -0
  8. training_metrics.json +0 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ replay.mp4 filter=lfs diff=lfs merge=lfs -text
PPO-Taxi-v3.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:328b262c8bf955dd38b5c7e468e10fd6fd59ec8326d986ff069d9d6c14394df6
3
+ size 906213
README.md ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - Taxi-v3
4
+ - reinforcement-learning
5
+ - rl-framework
6
+ model-index:
7
+ - name: PPO-Taxi-v3
8
+ results:
9
+ - task:
10
+ type: reinforcement-learning
11
+ name: reinforcement-learning
12
+ dataset:
13
+ name: Taxi-v3
14
+ type: Taxi-v3
15
+ metrics:
16
+ - type: mean_reward
17
+ value: 7.72 +/- 2.66
18
+ name: mean_reward
19
+ verified: false
20
+ ---
21
+
22
+
23
+ # PPO agent playing on *Taxi-v3*
24
+
25
+ This is a trained model of an agent playing on the environment *Taxi-v3*.
26
+ The agent was trained with a PPO algorithm and evaluated for 100 episodes.
27
+ See further agent and evaluation metadata in the according README section.
28
+
29
+
30
+ ## Import
31
+ The Python module used for training and uploading/downloading is [rl-framework](https://github.com/alexander-zap/rl-framework).
32
+ It is an easy-to-read, plug-and-use Reinforcement Learning framework and provides standardized interfaces
33
+ and implementations to various Reinforcement Learning methods and environments.
34
+
35
+ Also it provides connectors for the upload and download to popular model version control systems,
36
+ including the HuggingFace Hub.
37
+
38
+ ## Usage
39
+ ```python
40
+
41
+ from rl_framework import StableBaselinesAgent, StableBaselinesAlgorithm
42
+
43
+ # Create new agent instance
44
+ agent = StableBaselinesAgent(
45
+ algorithm=StableBaselinesAlgorithm.PPO
46
+ algorithm_parameters={
47
+ ...
48
+ },
49
+ )
50
+
51
+ # Download existing agent from HF Hub
52
+ repository_id = "zap-thamm/PPO-Taxi-v3"
53
+ file_name = "algorithm.zip"
54
+ agent.download(repository_id=repository_id, filename=file_name)
55
+
56
+ ```
57
+
58
+ Further examples can be found in the [exploration section of the rl-framework repository](https://github.com/alexander-zap/rl-framework/tree/main/exploration).
59
+
60
+
algorithm.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:83ede117870723f677867829415ce99e86492a34da2c0648e0020fc6fc631a07
3
+ size 905487
replay.mp4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:413589af02714640ab8b6e69369a9a889cb2e622fbc0d11b94527fcb4129566b
3
+ size 778530
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"env_id": "Taxi-v3", "mean_reward": 7.72, "n_eval_episodes": 100, "eval_datetime": "2024-01-12T14:21:28.530891"}
system.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"OS": "Windows-10-10.0.19045-SP0 10.0.19045", "Python": "3.10.8", "Stable-Baselines3": "2.2.1", "PyTorch": "2.1.2+cpu", "GPU Enabled": "False", "Numpy": "1.26.3", "Cloudpickle": "3.0.0", "Gymnasium": "0.29.1"}
training_metrics.json ADDED
The diff for this file is too large to render. See raw diff