Upload folder using huggingface_hub

Browse files

Files changed (8) hide show

.gitattributes +1 -0
PPO-Taxi-v3.zip +3 -0
README.md +60 -0
algorithm.zip +3 -0
replay.mp4 +3 -0
results.json +1 -0
system.json +1 -0
training_metrics.json +0 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+replay.mp4 filter=lfs diff=lfs merge=lfs -text

PPO-Taxi-v3.zip ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:328b262c8bf955dd38b5c7e468e10fd6fd59ec8326d986ff069d9d6c14394df6
+size 906213

README.md ADDED Viewed

	@@ -0,0 +1,60 @@

+---
+tags:
+- Taxi-v3
+- reinforcement-learning
+- rl-framework
+model-index:
+- name: PPO-Taxi-v3
+  results:
+  - task:
+      type: reinforcement-learning
+      name: reinforcement-learning
+    dataset:
+      name: Taxi-v3
+      type: Taxi-v3
+    metrics:
+    - type: mean_reward
+      value: 7.72 +/- 2.66
+      name: mean_reward
+      verified: false
+---
+# PPO agent playing on *Taxi-v3*
+This is a trained model of an agent playing on the environment *Taxi-v3*.
+The agent was trained with a PPO algorithm and evaluated for 100 episodes.
+See further agent and evaluation metadata in the according README section.
+## Import
+The Python module used for training and uploading/downloading is [rl-framework](https://github.com/alexander-zap/rl-framework).
+It is an easy-to-read, plug-and-use Reinforcement Learning framework and provides standardized interfaces
+and implementations to various Reinforcement Learning methods and environments.
+Also it provides connectors for the upload and download to popular model version control systems,
+including the HuggingFace Hub.
+## Usage
+```python
+from rl_framework import StableBaselinesAgent, StableBaselinesAlgorithm
+# Create new agent instance
+agent = StableBaselinesAgent(
+    algorithm=StableBaselinesAlgorithm.PPO
+    algorithm_parameters={
+        ...
+    },
+)
+# Download existing agent from HF Hub
+repository_id = "zap-thamm/PPO-Taxi-v3"
+file_name = "algorithm.zip"
+agent.download(repository_id=repository_id, filename=file_name)
+```
+Further examples can be found in the [exploration section of the rl-framework repository](https://github.com/alexander-zap/rl-framework/tree/main/exploration).

algorithm.zip ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:83ede117870723f677867829415ce99e86492a34da2c0648e0020fc6fc631a07
+size 905487

replay.mp4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:413589af02714640ab8b6e69369a9a889cb2e622fbc0d11b94527fcb4129566b
+size 778530

results.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"env_id": "Taxi-v3", "mean_reward": 7.72, "n_eval_episodes": 100, "eval_datetime": "2024-01-12T14:21:28.530891"}

system.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"OS": "Windows-10-10.0.19045-SP0 10.0.19045", "Python": "3.10.8", "Stable-Baselines3": "2.2.1", "PyTorch": "2.1.2+cpu", "GPU Enabled": "False", "Numpy": "1.26.3", "Cloudpickle": "3.0.0", "Gymnasium": "0.29.1"}

training_metrics.json ADDED Viewed

The diff for this file is too large to render. See raw diff