Kallinteris-Andreas commited on
Commit
34849cc
·
verified ·
1 Parent(s): cf184ad
README.md CHANGED
@@ -16,16 +16,19 @@ model-index:
16
  type: Walker2d-v5
17
  metrics:
18
  - type: mean_reward
19
- value: 6154.44 +/- 787.84
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
  # **SAC** Agent playing **Walker2d-v5**
 
25
  This is a trained model of a **SAC** agent playing **Walker2d-v5**
26
  using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
 
 
28
  ## Usage (with Stable-baselines3)
 
29
  TODO: Add your code
30
 
31
 
@@ -34,4 +37,4 @@ from stable_baselines3 import ...
34
  from huggingface_sb3 import load_from_hub
35
 
36
  ...
37
- ```
 
16
  type: Walker2d-v5
17
  metrics:
18
  - type: mean_reward
19
+ value: 6173.86 +/- 197.86
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
  # **SAC** Agent playing **Walker2d-v5**
25
+
26
  This is a trained model of a **SAC** agent playing **Walker2d-v5**
27
  using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
28
 
29
+
30
  ## Usage (with Stable-baselines3)
31
+
32
  TODO: Add your code
33
 
34
 
 
37
  from huggingface_sb3 import load_from_hub
38
 
39
  ...
40
+ ```
config.json CHANGED
@@ -1 +1 @@
1
- {"policy_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVMAAAAAAAAACMHnN0YWJsZV9iYXNlbGluZXMzLnNhYy5wb2xpY2llc5SMCVNBQ1BvbGljeZSTlC4=", "__module__": "stable_baselines3.sac.policies", "__annotations__": "{'actor': <class 'stable_baselines3.sac.policies.Actor'>, 'critic': <class 'stable_baselines3.common.policies.ContinuousCritic'>, 'critic_target': <class 'stable_baselines3.common.policies.ContinuousCritic'>}", "__doc__": "\n Policy class (with both actor and critic) for SAC.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param use_expln: Use ``expln()`` function instead of ``exp()`` when using gSDE to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param clip_mean: Clip the mean output when using gSDE to avoid numerical instability.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n :param n_critics: Number of critic networks to create.\n :param share_features_extractor: Whether to share or not the features extractor\n between the actor and the critic (this saves computation time)\n ", "__init__": "<function SACPolicy.__init__ at 0x7fe97d6284a0>", "_build": "<function SACPolicy._build at 0x7fe97d628ae0>", "_get_constructor_parameters": "<function SACPolicy._get_constructor_parameters at 0x7fe97d628b80>", "reset_noise": "<function SACPolicy.reset_noise at 0x7fe97d628c20>", "make_actor": "<function SACPolicy.make_actor at 0x7fe97d628cc0>", "make_critic": "<function SACPolicy.make_critic at 0x7fe97d628d60>", "forward": "<function SACPolicy.forward at 0x7fe97d628e00>", "_predict": "<function SACPolicy._predict at 0x7fe97d628ea0>", "set_training_mode": "<function SACPolicy.set_training_mode at 0x7fe97d628f40>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x7fe97d62cf40>"}, "verbose": 0, "policy_kwargs": {"use_sde": false}, "num_timesteps": 20000000, "_total_timesteps": 20000000, "_num_timesteps_at_start": 0, "seed": 0, "action_noise": null, "start_time": 1731416626161730236, "learning_rate": 0.0003, "tensorboard_log": "runs/n58k3j87", "_last_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVHQMAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJaoAgAAAAAAAEeyxT4GA/M/NRwZm6y2ur9eYlQJ8Vnnv0Di5xmi+3Y/WC7ZiDTJ1j+VeApA9hafP6DTPxBbhNi/By/tjGGB4D+UlRYolp0UQExzJruLwNy//yZBw7IeEMDm//wz0tjev2ThGDwhENA/ruo641WVFEAxc/93wEy6PwAAAAAAACTAAAAAAAAAJMDe7vBC6n3zP5RULBYHHZ8/I38qkVB2mz92AhDEXNzbv3ufKsRvwaM/gr9J4/2O1r8MwOfUX5zgv4J98Pclr3c/c36TIJc1FkC46u681fW7P7pEoJMV1uC/jbJYx1jM1b8AAAAAAAAkwF5hqRM1rRPAvKJ45gg7+L/vYb44CvgbQChMoRqm+RFAVeS/2YK88j994SKNsTrav7VcKlFcqqM/tJ70xWTdiT8UDmtdO6XpP4ZV0PWqfvC/Q7wB22qpsD8TdGOlTVbVPz9B/LFGoxZA/t6ySbS58r8BcSeE+18DwIOkrDSNfNG/u96b5tgbuj9LrKL5LFS4P+TjXHTIdQfAtJWors9w+j+l0cpfNIMfQA9vzRVecvM/q33E1SpV1r+WUPU5bemgP2ANHd+fD5Q/RpWMywqf6T/UcJHKuQTgv80sdcYdeee/rMi7vW1t7b+I+2gHOvMVQEllwtraodA/saVGlhrL4D93PDmk6DvSP8pEYZImpao/hzJoUlIk4T/+VGS8/cwOQHBrJEfD1SHAcEbKJwuMG8CviDie2Df1P8SGQXV0Mbk/xC10Dc+2079rRmgnHhPRv+fg0qtV7+2/N1ixrGtKpz+GpBhPpU2kP1jKigpOL+U/UOaYWG1rGUCKjxMVsUDhPzun3fKr2AbAXVLxchUoIcAu66kZC64UQD730rdakAxAylVxCq1c1782bGmMY9LpPwfVRKp6GRZAlIwFbnVtcHmUjAVkdHlwZZSTlIwCZjiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksFSxGGlIwBQ5R0lFKULg=="}, "_last_episode_starts": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVeAAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYFAAAAAAAAAAEBAQEBlIwFbnVtcHmUjAVkdHlwZZSTlIwCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksFhZSMAUOUdJRSlC4="}, "_last_original_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVHQMAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJaoAgAAAAAAAPoG88zHEfM/T3c65f4Ps7+W5Gwetzznv2l9yH/4J2Q/9H49ubxO1D/V4e524MCfP4Dvyu4nNNC/KjI+5zo45D9DPuwI95IVQPNUZYzJHt2/ZU2KQ9MADMDwxDUnGjbfv44s+6RPNOI/Vxz1pqsFEkB1eXDcRCHUvwAAAAAAACTAOrquZVscIMBKOzbs1HnzP8n0OhID8KA/D5qq+xxenj/EXcW6CU7Wv7Xvf7wzpbk/S2o7HMul1b/JTMnit4viv4zSC8NrF6G/M0/eE8ATF0B1XJ4fXuPBPwj/Ix+JzsS/GJ35RrsS178AAAAAAAAkwAAAAAAAACTAk5pT2RNrAMDAmryYcjMgQMBaLcGbUxVA/zFJ32Xj8j9/rnIIkerYv9cukYZoBKU/qw9bwk2BiD+L4dqsK6DpP16YYk4t+++/Br4CmvQppD8gJpa7bVfRP3o5tzdsrhZAeRUxb2RM879SRvC1WLAFwKMZHOfOY9m/I+B9SaMorT+H4QElp4OrP7e+010SiBTAtZ1f9Z2ZFEDI6cB+pnYeQK+uvZnhZ/M/0a5suj2Q1r+9rxnWIHGfP03mladCLJM/OUrjaEZ86T8K4i3GhyXhv5C50R/BEeW/lXhLuRdt6r9oQUErP6sVQGacPC09Q9s/wFa3c99p2T9+r0PPGyzRP0PH8KxyicY/Umn379kK4D/I7Qe/lYsTQN8wj0jhBSLAAAAAAAAAJMABWdgEZST1P6En37uLOb4/C7NTFyGJzr+EytuzdCfUv3aqo5Ou0u6/Sl0YFsq+qD/Ij7YgxHaaPxjn8Axf8uM/E5zRtnt3GkC9+NJJD8XkP7WnMiVudP+/3KCoRC5iIcCz8q+R2XQbQPkBBRWh1glA1MReJAMd1r+OHZiaXfEGQM7g6yWVWhBAlIwFbnVtcHmUjAVkdHlwZZSTlIwCZjiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksFSxGGlIwBQ5R0lFKULg=="}, "_episode_num": 23437, "use_sde": false, "sde_sample_freq": -1, "_current_progress_remaining": 0.0, "_stats_window_size": 100, "ep_info_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVRAwAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQLnSk1He7+WMAWyUTegDjAF0lEdA672m98Z1m3V9lChoBkdAuN3+ZfD1oWgHTegDaAhHQOu+Npqh11Z1fZQoaAZHQLpN8nXd0q9oB03oA2gIR0DrvlrAbhm5dX2UKGgGR0C6Vmzjm0VraAdN6ANoCEdA6750dd3Sr3V9lChoBkdAujaCM4tHx2gHTegDaAhHQOu+7MfvF3p1fZQoaAZHQLk6t6ySmqJoB03oA2gIR0Drv3V9zfaYdX2UKGgGR0C6SpENayKOaAdN6ANoCEdA68AX+ueSS3V9lChoBkdAuk+5tBOYY2gHTegDaAhHQOvAPMtbs4V1fZQoaAZHQLqITwqy4WloB03oA2gIR0DrwFVCSA6NdX2UKGgGR0C6ANH84xUOaAdN6ANoCEdA68DWF05lv3V9lChoBkdAujMJJ+UhV2gHTegDaAhHQOvBYmoFV1h1fZQoaAZHQLnlX/ATIvJoB03oA2gIR0DrwfqzRhMKdX2UKGgGR0C6lo1NcnmaaAdN6ANoCEdA68IfM+mm+HV9lChoBkdAue2awMYuTWgHTegDaAhHQOvCOAFPi1l1fZQoaAZHQLmJJT9KmKtoB03oA2gIR0DrwroqXF98dX2UKGgGR0C3rHdIGyHEaAdN6ANoCEdA68NFKur6tXV9lChoBkdAugLXaBZpz2gHTegDaAhHQOvD2P+qBEt1fZQoaAZHQLpIhYPoV21oB03oA2gIR0Drw/uH+IdmdX2UKGgGR0C5pqiUC7sfaAdN6ANoCEdA68QSkSVW0nV9lChoBkdAutyMuPFNtmgHTegDaAhHQOvEks6HTJB1fZQoaAZHQLnUEi7CiypoB03oA2gIR0DrxSqJWvKVdX2UKGgGR0C540b5AQg+aAdN6ANoCEdA68Xb60QbuXV9lChoBkdAuq+UyuZCwGgHTegDaAhHQOvF+lQ/HHZ1fZQoaAZHQLp7RMewLVpoB03oA2gIR0DrxhALMLWqdX2UKGgGR0C5/6zyBkI5aAdN6ANoCEdA68aPmhdt23V9lChoBkdAui1YF9roGWgHTegDaAhHQOvHLiKekHl1fZQoaAZHQLqRYyoXKr9oB03oA2gIR0Drx+OGM4tIdX2UKGgGR0C6j3TTfBN3aAdN6ANoCEdA68gShLwnY3V9lChoBkdAuiyRglWwNmgHTegDaAhHQOvILxu4wyt1fZQoaAZHQLn+O3Zf2K5oB03oA2gIR0DryMBXQMQVdX2UKGgGR0C6g33UUfxMaAdN6ANoCEdA68ljWll9SnV9lChoBkdAujDAGlhw2mgHTegDaAhHQOvKErVH4Gl1fZQoaAZHQLnXMW2w3YNoB03oA2gIR0Dryj2BjnV5dX2UKGgGR0C6tchQWN3oaAdN6ANoCEdA68paLmp2lnV9lChoBkdAuB9cm6XjVGgHTbYDaAhHQOvKzhHEuQJ1fZQoaAZHQLnraAo5PuZoB03oA2gIR0Dry5IIRAbAdX2UKGgGR0C6WpTCDVYqaAdN6ANoCEdA68w/aI3zc3V9lChoBkdAunWc5OrQxGgHTegDaAhHQOvMZ1KTSst1fZQoaAZHQLp+tzl90A9oB03oA2gIR0DrzH/SApazdX2UKGgGR0C6Y6H1SOzZaAdN6ANoCEdA68z+StmthnV9lChoBkdAuRcswvg3tWgHTegDaAhHQOvNsee18b91fZQoaAZHQLozcgPEsJ9oB03oA2gIR0DrzlsPuogndX2UKGgGR0C6Rd6aLGaQaAdN6ANoCEdA686CtPpIMHV9lChoBkdAurZiJrLyMGgHTegDaAhHQOvOoGj7AL11fZQoaAZHQLqG0jynUDxoB03oA2gIR0DrzxmX0Gu+dX2UKGgGR0C6ahvVmSQpaAdN6ANoCEdA68/fS0rsjXV9lChoBkdAulYwD1XeWWgHTegDaAhHQOvQhnXqZ+h1fZQoaAZHQLoXoulXRw9oB03oA2gIR0Dr0LNLdvbXdX2UKGgGR0C6EWB1oxpMaAdN6ANoCEdA69DYaPCEYnV9lChoBkdAuth5J17pmmgHTegDaAhHQOvRQ1X3g1p1fZQoaAZHQLrU9Rx95QhoB03oA2gIR0Dr0eoxQBPsdX2UKGgGR0C6gxnOB19waAdN6ANoCEdA69Klq0D2anV9lChoBkdAuqbX52yLRGgHTegDaAhHQOvS0Bx5s0p1fZQoaAZHQLrRrG4ZuQ9oB03oA2gIR0Dr0vDH2AXmdX2UKGgGR0C6fKkfgaWHaAdN6ANoCEdA69NuxQJokHV9lChoBkdAuln8puuRtGgHTegDaAhHQOvUFPJiiIt1fZQoaAZHQLka2UEPlMhoB03oA2gIR0Dr1MF19v0idX2UKGgGR0Cxgsxs/IKdaAdNzAJoCEdA69ThVFH8THV9lChoBkdAugl9zIV/MGgHTegDaAhHQOvU7P+bVjJ1fZQoaAZHQLmL3EJ0GNdoB03oA2gIR0Dr1QmZhKDkdX2UKGgGR0C5+HvCdjG2aAdN6ANoCEdA69YXj2alUXV9lChoBkdAuEN7DBMzuWgHTegDaAhHQOvWtQJokAx1fZQoaAZHQLrz6A/9pAVoB03oA2gIR0Dr1s1wazeGdX2UKGgGR0C6MWG+K0laaAdN6ANoCEdA69bVUmlZYHV9lChoBkdAuopkGdI5HWgHTegDaAhHQOvW7GvIOpd1fZQoaAZHQLrR7YfW+XZoB03oA2gIR0Dr1/cFaB7NdX2UKGgGR0C6MWloYekpaAdN6ANoCEdA69ie6Ogg5nV9lChoBkdAuq3Glj3Eh2gHTegDaAhHQOvYwbPdEb51fZQoaAZHQLo+DvwEyL1oB03oA2gIR0Dr2MwgyM1kdX2UKGgGR0C63fV1GLDRaAdN6ANoCEdA69joMk6cRXV9lChoBkdAurUhM36yjmgHTegDaAhHQOvaDRWaMJh1fZQoaAZHQLpHEs6JZW9oB03oA2gIR0Dr2qDkLhJidX2UKGgGR0C6lNPHT7VKaAdN6ANoCEdA69q8pX6qKnV9lChoBkdAuqyI31jAi2gHTegDaAhHQOvax9ktmL91fZQoaAZHQLpB2eK8+RpoB03oA2gIR0Dr2uJeTmnwdX2UKGgGR0C6PRyx/ustaAdN6ANoCEdA69wQtITXa3V9lChoBkdAumrmQPqcE2gHTegDaAhHQOvclL9n9Nx1fZQoaAZHQLnibEPUaydoB03oA2gIR0Dr3K2DM/yHdX2UKGgGR0C6GpMZccENaAdN6ANoCEdA69y2s10knnV9lChoBkdAudp2wQlKLGgHTegDaAhHQOvc0BSYPXl1fZQoaAZHQLqZwnKGL1poB03oA2gIR0Dr3e8E8q4IdX2UKGgGR0C6lrZ/wy6+aAdN6ANoCEdA696N12iconV9lChoBkdAunA1BkZrHmgHTegDaAhHQOveq05dWyV1fZQoaAZHQLqgmOvMbFVoB03oA2gIR0Dr3rRfTkQxdX2UKGgGR0C6SUwA2hqTaAdN6ANoCEdA697P3Wvr4XV9lChoBkdAudvXjjrAxmgHTegDaAhHQOvf8leF+NN1fZQoaAZHQLmAqtnwob5oB03oA2gIR0Dr4KFIkqtpdX2UKGgGR0C5tC6e5Fw2aAdN6ANoCEdA6+C9vYWcjXV9lChoBkdAucOW2w3YMGgHTegDaAhHQOvgx5X0Xgt1fZQoaAZHQLqvby7wrlNoB03oA2gIR0Dr4OCowVTKdX2UKGgGR0C6Dv4iLVFyaAdN6ANoCEdA6+H6tNzr/3V9lChoBkdAuspNl/Yra2gHTegDaAhHQOvim9CVryl1fZQoaAZHQLpOY5EMLF5oB03oA2gIR0Dr4rd752yLdX2UKGgGR0C6vVwMUh3aaAdN6ANoCEdA6+LBXbM5fnV9lChoBkdAumkKhCdBjWgHTegDaAhHQOvi3IbyYol1fZQoaAZHQLpMkA31jAloB03oA2gIR0Dr4/O51eSkdX2UKGgGR0C6X67g0j1PaAdN6ANoCEdA6+SWoJqqO3V9lChoBkdAujLC9/SYxGgHTegDaAhHQOvksBGOMl11fZQoaAZHQLn2sG96C19oB03oA2gIR0Dr5LlBfKISdX2UKGgGR0C5+9PYJ3PiaAdN6ANoCEdA6+TQg1vVE3VlLg=="}, "ep_success_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="}, "_n_updates": 3998000, "buffer_size": 1000000, "batch_size": 256, "learning_starts": 10000, "tau": 0.005, "gamma": 0.99, "gradient_steps": 1, "optimize_memory_usage": false, "replay_buffer_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVNQAAAAAAAACMIHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5idWZmZXJzlIwMUmVwbGF5QnVmZmVylJOULg==", "__module__": "stable_baselines3.common.buffers", "__annotations__": "{'observations': <class 'numpy.ndarray'>, 'next_observations': <class 'numpy.ndarray'>, 'actions': <class 'numpy.ndarray'>, 'rewards': <class 'numpy.ndarray'>, 'dones': <class 'numpy.ndarray'>, 'timeouts': <class 'numpy.ndarray'>}", "__doc__": "\n Replay buffer used in off-policy algorithms like SAC/TD3.\n\n :param buffer_size: Max number of element in the buffer\n :param observation_space: Observation space\n :param action_space: Action space\n :param device: PyTorch device\n :param n_envs: Number of parallel environments\n :param optimize_memory_usage: Enable a memory efficient variant\n of the replay buffer which reduces by almost a factor two the memory used,\n at a cost of more complexity.\n See https://github.com/DLR-RM/stable-baselines3/issues/37#issuecomment-637501195\n and https://github.com/DLR-RM/stable-baselines3/pull/28#issuecomment-637559274\n Cannot be used in combination with handle_timeout_termination.\n :param handle_timeout_termination: Handle timeout termination (due to timelimit)\n separately and treat the task as infinite horizon task.\n https://github.com/DLR-RM/stable-baselines3/issues/284\n ", "__init__": "<function ReplayBuffer.__init__ at 0x7fe98c274cc0>", "add": "<function ReplayBuffer.add at 0x7fe98c274e00>", "sample": "<function ReplayBuffer.sample at 0x7fe98c274ea0>", "_get_samples": "<function ReplayBuffer._get_samples at 0x7fe98c274f40>", "_maybe_cast_dtype": "<staticmethod(<function ReplayBuffer._maybe_cast_dtype at 0x7fe98c274fe0>)>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x7fe98c466c80>"}, "replay_buffer_kwargs": {}, "train_freq": {":type:": "<class 'stable_baselines3.common.type_aliases.TrainFreq'>", ":serialized:": "gAWVYQAAAAAAAACMJXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi50eXBlX2FsaWFzZXOUjAlUcmFpbkZyZXGUk5RLAWgAjBJUcmFpbkZyZXF1ZW5jeVVuaXSUk5SMBHN0ZXCUhZRSlIaUgZQu"}, "use_sde_at_warmup": false, "target_entropy": -6.0, "ent_coef": "auto", "target_update_interval": 1, "observation_space": {":type:": "<class 'gymnasium.spaces.box.Box'>", ":serialized:": "gAWVsQIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY4lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMBl9zaGFwZZRLEYWUjANsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWiAAAAAAAAAAAAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/lGgLSxGFlIwBQ5R0lFKUjA1ib3VuZGVkX2JlbG93lGgTKJYRAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAlGgIjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxGFlGgWdJRSlIwEaGlnaJRoEyiWiAAAAAAAAAAAAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/lGgLSxGFlGgWdJRSlIwNYm91bmRlZF9hYm92ZZRoEyiWEQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAJRoHUsRhZRoFnSUUpSMCGxvd19yZXBylIwELWluZpSMCWhpZ2hfcmVwcpSMA2luZpSMCl9ucF9yYW5kb22UTnViLg==", "dtype": "float64", "_shape": [17], "low": "[-inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf\n -inf -inf -inf]", "bounded_below": "[False False False False False False False False False False False False\n False False False False False]", "high": "[inf inf inf inf inf inf inf inf inf inf inf inf inf inf inf inf inf]", "bounded_above": "[False False False False False False False False False False False False\n False False False False False]", "low_repr": "-inf", "high_repr": "inf", "_np_random": null}, "action_space": {":type:": "<class 'gymnasium.spaces.box.Box'>", ":serialized:": "gAWVfgIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMBl9zaGFwZZRLBoWUjANsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWGAAAAAAAAAAAAIC/AACAvwAAgL8AAIC/AACAvwAAgL+UaAtLBoWUjAFDlHSUUpSMDWJvdW5kZWRfYmVsb3eUaBMolgYAAAAAAAAAAQEBAQEBlGgIjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSwaFlGgWdJRSlIwEaGlnaJRoEyiWGAAAAAAAAAAAAIA/AACAPwAAgD8AAIA/AACAPwAAgD+UaAtLBoWUaBZ0lFKUjA1ib3VuZGVkX2Fib3ZllGgTKJYGAAAAAAAAAAEBAQEBAZRoHUsGhZRoFnSUUpSMCGxvd19yZXBylIwELTEuMJSMCWhpZ2hfcmVwcpSMAzEuMJSMCl9ucF9yYW5kb22UjBRudW1weS5yYW5kb20uX3BpY2tsZZSMEF9fZ2VuZXJhdG9yX2N0b3KUk5SMBVBDRzY0lGgyjBRfX2JpdF9nZW5lcmF0b3JfY3RvcpSTlIaUUpR9lCiMDWJpdF9nZW5lcmF0b3KUjAVQQ0c2NJSMBXN0YXRllH2UKGg9ihDjYZWmt15YCS1Fllk0taEajANpbmOUihCpc3hEvDOBWIIa9zrb2o1BdYwKaGFzX3VpbnQzMpRLAIwIdWludGVnZXKUSwB1YnViLg==", "dtype": "float32", "_shape": [6], "low": "[-1. -1. -1. -1. -1. -1.]", "bounded_below": "[ True True True True True True]", "high": "[1. 1. 1. 1. 1. 1.]", "bounded_above": "[ True True True True True True]", "low_repr": "-1.0", "high_repr": "1.0", "_np_random": "Generator(PCG64)"}, "n_envs": 5, "lr_schedule": {":type:": "<class 'function'>", ":serialized:": "gAWVHgQAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLBUsTQyaVAZcAdAEAAAAAAAAAAAIAiQF8AKsBAAAAAAAAqwEAAAAAAABTAJROhZSMBWZsb2F0lIWUjBJwcm9ncmVzc19yZW1haW5pbmeUhZSMVC9ob21lL21hc3Rlci1hbmRyZWFzL2dlbl9kYXRhc2V0L3N0YWJsZS1iYXNlbGluZXMzL3N0YWJsZV9iYXNlbGluZXMzL2NvbW1vbi91dGlscy5weZSMCDxsYW1iZGE+lIwhZ2V0X3NjaGVkdWxlX2ZuLjxsb2NhbHM+LjxsYW1iZGE+lEthQxD4gACkZalO0DtN0yxO1CZPlEMAlIwOdmFsdWVfc2NoZWR1bGWUhZQpdJRSlH2UKIwLX19wYWNrYWdlX1+UjBhzdGFibGVfYmFzZWxpbmVzMy5jb21tb26UjAhfX25hbWVfX5SMHnN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi51dGlsc5SMCF9fZmlsZV9flIxUL2hvbWUvbWFzdGVyLWFuZHJlYXMvZ2VuX2RhdGFzZXQvc3RhYmxlLWJhc2VsaW5lczMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUaACMEl9mdW5jdGlvbl9zZXRzdGF0ZZSTlGgjfZR9lChoGowIPGxhbWJkYT6UjAxfX3F1YWxuYW1lX1+UjCFnZXRfc2NoZWR1bGVfZm4uPGxvY2Fscz4uPGxhbWJkYT6UjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgbjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOUaAIoaAcoSwFLAEsASwFLAUsTQwiVAZcAiQFTAJRoCSmMAV+UhZRoDowEZnVuY5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUS4VDCPiAANgPEogKlGgSjAN2YWyUhZQpdJRSlGgXTk5oHylSlIWUdJRSlGglaEF9lH2UKGgajARmdW5jlGgpjBljb25zdGFudF9mbi48bG9jYWxzPi5mdW5jlGgrfZRoLU5oLk5oL2gbaDBOaDFoM0c/M6kqMFUyYYWUUpSFlIwXX2Nsb3VkcGlja2xlX3N1Ym1vZHVsZXOUXZSMC19fZ2xvYmFsc19flH2UdYaUhlIwhZRSlIWUaEpdlGhMfZR1hpSGUjAu"}, "batch_norm_stats": [], "batch_norm_stats_target": [], "system_info": {"OS": "Linux-6.6.59-1-MANJARO-x86_64-with-glibc2.40 # 1 SMP PREEMPT_DYNAMIC Fri Nov 1 05:33:52 UTC 2024", "Python": "3.12.7", "Stable-Baselines3": "2.4.0a10", "PyTorch": "2.4.1+cu121", "GPU Enabled": "True", "Numpy": "1.26.4", "Cloudpickle": "3.1.0", "Gymnasium": "1.0.0"}}
 
1
+ {"policy_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVMAAAAAAAAACMHnN0YWJsZV9iYXNlbGluZXMzLnNhYy5wb2xpY2llc5SMCVNBQ1BvbGljeZSTlC4=", "__module__": "stable_baselines3.sac.policies", "__annotations__": "{'actor': <class 'stable_baselines3.sac.policies.Actor'>, 'critic': <class 'stable_baselines3.common.policies.ContinuousCritic'>, 'critic_target': <class 'stable_baselines3.common.policies.ContinuousCritic'>}", "__doc__": "\n Policy class (with both actor and critic) for SAC.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param use_expln: Use ``expln()`` function instead of ``exp()`` when using gSDE to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param clip_mean: Clip the mean output when using gSDE to avoid numerical instability.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n :param n_critics: Number of critic networks to create.\n :param share_features_extractor: Whether to share or not the features extractor\n between the actor and the critic (this saves computation time)\n ", "__init__": "<function SACPolicy.__init__ at 0x7f7385c0ce00>", "_build": "<function SACPolicy._build at 0x7f7385c0d3a0>", "_get_constructor_parameters": "<function SACPolicy._get_constructor_parameters at 0x7f7385c0d440>", "reset_noise": "<function SACPolicy.reset_noise at 0x7f7385c0d4e0>", "make_actor": "<function SACPolicy.make_actor at 0x7f7385c0d580>", "make_critic": "<function SACPolicy.make_critic at 0x7f7385c0d620>", "forward": "<function SACPolicy.forward at 0x7f7385c0d6c0>", "_predict": "<function SACPolicy._predict at 0x7f7385c0d760>", "set_training_mode": "<function SACPolicy.set_training_mode at 0x7f7385c0d800>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x7f7385c14980>"}, "verbose": 0, "policy_kwargs": {"use_sde": false}, "num_timesteps": 5700000, "_total_timesteps": 6000000, "_num_timesteps_at_start": 0, "seed": 0, "action_noise": null, "start_time": 1736092312186653182, "learning_rate": 0.0003, "tensorboard_log": "runs/0", "_last_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVHQMAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJaoAgAAAAAAANLMTdozCfI/satoOoJG5L/SVamXUwHuvw3X8FCIFI0/uFYtKR0f6j/qxHl1Ej+SPxffvjhb0/S/oq/Hsb0P6j+kN+0KVu3uPx3d0qRbkcS/hCy50gZDxr8Cr4lFcu7zv1xTbqcGsq6/yLETqkOeyr+QSI159FPSv/5/TrSfKfa/HpksfFEsez8zqr8Gg1fzP47RUznpF4o/ZW0w9ot/27/0SdYlPxqaP/GDfy5/XNG/izaSumEjlT9mLiE7Vcryv0PIOjvtwOK/qr6tBSnSE0Dhro1KoQDcP6kniINh6QTAc77EtxfTHcBmcuqvraLNv9oFTcdpYQjA3hVuttWI6b+c8jiXLgILwEAgON4zfCNAV14V5QSc8z+WfqFUEbHJP44C0FR5Fc+/9Z36I49biD+jFn9v2RvrPyXNwXoAS5s/PTz/PC/I5b+jfj+ZU6jsvwQQn4gJUxNA1RLG9tKD8T8nDOnH0CcDQN1GKoXK1B5AgfPEKSiFxD8SLjex1Wzhv+MH871Se96/6KRKVT4XG0AAAAAAAAAkwCVUkGvtC/M/mo3RJb/+wb+/anTh22ajP16fYNuiSY4/WDLHa/nr6T/rywjqcx/Mv52s0Sqjk++/FxSGRqe+6r8PnSR7B3wZQKDMjJDxON2/9DxhwX1U/T9E48LvkSaQv83dji1F1Oy/THsx9wCc6L+J4OOwlwURQBftjJPRkh7AwzcJ+GzgIUDsa1TG5hj1P2yCF2SWz9E/OTJtNRRn07/oZ1g5lM+iP0RmduFqvuo/Th8iPGQ6kz+lfHA11l7nv5BR6rPUauq/Fqgiit96FEAjbfyHei/BP9KZ0pCOjQFARekgkcxlHEDOCvHlKBXbv8jVbmLK89+/diF+z0ferL+ClB0KYxsFwJzz5tNVr9k/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksFSxGGlIwBQ5R0lFKULg=="}, "_last_episode_starts": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVeAAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYFAAAAAAAAAAEBAQEBlIwFbnVtcHmUjAVkdHlwZZSTlIwCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksFhZSMAUOUdJRSlC4="}, "_last_original_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVHQMAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJaoAgAAAAAAAM0RQnkXC/I/W8cSOHiC5L/w1FXvMfvtv9iLRBA/mIw/ZmRWFFso6j/KxEucHRuBP3PA6yL0kPS//MzM+MkO6j85gaR45V7yP8xzKj4NCqc/jYxARPoWAEB6xBviDM/wPwAiIzh5bsI/CJI4cJxvqr8N0/j6fVcFQDODz8TMdgXAbEfLFqlJmT9+ySNPbErzP0i4Uvid3qE/NOho1KWB17/t+8o3f+ibP200U0avG9C/ninVW3XdnT9MMIq5ilfyv0CJ39IcSOW/8rEN5zLjE0DJzLMqLSDXP2n8y7JjswfAp66z261TIMBdjFyChQXJv/GwF/+9agDA0PYB0eQZ9r91CHdjZe4MwAAAAAAAACRAzf3n4nhz8z/GcLYB00XHP3xlgpgZhdO/VONUmT7BhT9okJg/2yDrP1/ChMurPp8/hO1aV9mv57/9ywol1yzmv7I4HD0djxNAGLQrBi6e9D9EwC/KwtEDQKwPqFaFyh9AhcfUM3XQxD8swwrWMcThPzFTfkbeC+C/dDhe9SGOHEAAAAAAAAAkwLswlF0vHfM/i+jBh0Grw78rQW2988KjP46rDgckuZA/BRoQxZb26T9x3HuutjzQv3V3lpWxg+2/9aAnZ/OR7L+p4t890DMZQGQhHOFW1t+/tNV5CcAy9z/pNM9/s+7Hv6rdyaWair2/P5GJCDm5xb+DA19TjN0QQHyCfpgDZSDA2ewFH7jjE0Cog3yH6RH1P0z0Tu4we9A/aiJcCHJF1787VK+XCXukP+Z8/l4j4uo//tn28Ddkkz8f17ngluPmv4mXB+lSguq/lSLU3YhOFUAcAmfetdDSP/4D2hIbGwhA7yZVL+QKIEB9+XNGWAXYvzOLJWe+0eK/3NzWyyfEmz8yIeagndrxvz04s/mjl9I/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksFSxGGlIwBQ5R0lFKULg=="}, "_episode_num": 8819, "use_sde": false, "sde_sample_freq": -1, "_current_progress_remaining": 0.05000083333333338, "_stats_window_size": 100, "ep_info_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVRAwAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQLapVuB+WnmMAWyUTegDjAF0lEdA5FU+5KWcBnV9lChoBkdAtxCUG/vfCWgHTegDaAhHQORVSDBTGYN1fZQoaAZHQLbZBz0HyEtoB03oA2gIR0DkVVA83dbgdX2UKGgGR0C2bQzKs+3ZaAdN6ANoCEdA5FnNIOQQtnV9lChoBkdAtyOWf9P1tmgHTegDaAhHQORZ1M5sCT51fZQoaAZHQLbMiDAaef9oB03oA2gIR0DkWlZ8lXzUdX2UKGgGR0C3Z5DTz/ZNaAdN6ANoCEdA5FpewCbMHXV9lChoBkdAt1hkDp1RtWgHTegDaAhHQORaZfck+ot1fZQoaAZHQLdRio5ggHNoB03oA2gIR0DkXx63IdU9dX2UKGgGR0C3QrEUO/cnaAdN6ANoCEdA5F8lAYYR/XV9lChoBkdAtwAMFGG21GgHTegDaAhHQORfo96HCXR1fZQoaAZHQLbWaMK1G9ZoB03oA2gIR0DkX6vt2LYPdX2UKGgGR0C3cHFc+qzaaAdN6ANoCEdA5F+zj6WPcXV9lChoBkdAtye0wudwvWgHTegDaAhHQORkL6SmqHZ1fZQoaAZHQLaRG/9pAUtoB03oA2gIR0DkZDYZFXq8dX2UKGgGR0C21YAfuCwsaAdN6ANoCEdA5GSz7EHdGnV9lChoBkdAty5QCeVcEGgHTegDaAhHQORkvD5TIeZ1fZQoaAZHQLc3/YcNpdtoB03oA2gIR0DkZMOH8jzJdX2UKGgGR0C3IK8TnJT3aAdN6ANoCEdA5GkJ8xKxs3V9lChoBkdAtwjjp5eJHmgHTegDaAhHQORpEMasIVx1fZQoaAZHQLZ8YRkVerxoB03oA2gIR0DkaZJI4lyBdX2UKGgGR0C3OVNW6shgaAdN6ANoCEdA5GmbF6RhdHV9lChoBkdAtxNG8+Roy2gHTegDaAhHQORpos0WM0h1fZQoaAZHQLegZS9/SYxoB03oA2gIR0DkbfiKzAvddX2UKGgGR0C3Gpp1ie/YaAdN6ANoCEdA5G3+fffoBHV9lChoBkdAtuP5Q/HHWGgHTegDaAhHQORudav5gw51fZQoaAZHQLazgRwZOzpoB03oA2gIR0Dkbn7fZ26kdX2UKGgGR0C3rSQc5sCUaAdN6ANoCEdA5G6GIqbz9XV9lChoBkdAtz1F29tdiWgHTegDaAhHQORzD4CU5dZ1fZQoaAZHQLcw0QYk3S9oB03oA2gIR0DkcxbfVI7OdX2UKGgGR0C3J6r/GVAzaAdN6ANoCEdA5HOk7iZOSHV9lChoBkdAtv4yckMTe2gHTegDaAhHQORzrY6jnFJ1fZQoaAZHQLanZ3/givBoB03oA2gIR0Dkc7XvAoG6dX2UKGgGR0C2qESr5qM4aAdN6ANoCEdA5HftjTa0yHV9lChoBkdAtysMjqv/zmgHTegDaAhHQOR39AL1EmZ1fZQoaAZHQLcFRcpLEk1oB03oA2gIR0DkeHLgogFHdX2UKGgGR0C1sxR/3FkyaAdN6ANoCEdA5Hh8Qu/UOXV9lChoBkdAtwO/HcUM5WgHTegDaAhHQOR4g8hvBJt1fZQoaAZHQJVBm7EpAlhoB00sAWgIR0Dke7R4X40udX2UKGgGR0C2+wAGfPHDaAdN6ANoCEdA5HyKxHoX9HV9lChoBkdAtuKi+UQkHGgHTegDaAhHQOR8kS9RJmN1fZQoaAZHQLa9qCf6Gg1oB03oA2gIR0DkfRTUF0PpdX2UKGgGR0C2fi2V7hNuaAdN6ANoCEdA5H0cr0aqCHV9lChoBkdAoa9DpmmLtWgHTckBaAhHQOR9f5eXzDp1fZQoaAZHQLbSaocaOxVoB03oA2gIR0Dkf9iOe8PGdX2UKGgGR0C21WAT7EYPaAdN6ANoCEdA5ICqNWU8m3V9lChoBkdAt2AOtyPuHGgHTegDaAhHQOSBN1yvLYB1fZQoaAZHQLaoV3M6ikBoB03oA2gIR0DkgUBoFmnPdX2UKGgGR0C22pJ0KZ2IaAdN6ANoCEdA5IGvf8VHnXV9lChoBkdAtzTYAWBSUGgHTegDaAhHQOSFJEi8nNR1fZQoaAZHQLePsR1X/5toB03oA2gIR0DkhgXar3j/dX2UKGgGR0C3fRUr9VFQaAdN6ANoCEdA5IaQj2i+L3V9lChoBkdAt38vHp8neGgHTegDaAhHQOSGmidxyXF1fZQoaAZHQLdZyzMibDxoB03oA2gIR0DkhwGeT3ZgdX2UKGgGR0C3IvCO3lS1aAdN6ANoCEdA5Ipvae5Fw3V9lChoBkdAtvlbDtPYWmgHTegDaAhHQOSLT2OIZZV1fZQoaAZHQLc0P1c+qzZoB03oA2gIR0Dki+CD7IkrdX2UKGgGR0C25IzhHbypaAdN6ANoCEdA5Ivp/SQYDXV9lChoBkdAt1nQw9JSSGgHTegDaAhHQOSMWmjM3ZR1fZQoaAZHQLeVZIsRQJpoB03oA2gIR0Dkj5TILgGbdX2UKGgGR0C2+4nY+Sr6aAdN6ANoCEdA5JCF5Nwiq3V9lChoBkdAtwWTyxzJZGgHTegDaAhHQOSRFP1anrJ1fZQoaAZHQLcnUcvduYRoB03oA2gIR0DkkR0QaJhwdX2UKGgGR0C3NmwT/Q0GaAdN6ANoCEdA5JGDCcwxnHV9lChoBkdAt4vcNCqp+GgHTegDaAhHQOSU9SvX9R91fZQoaAZHQLc9HGJemeloB03oA2gIR0Dklb1oC+10dX2UKGgGR0C3smYgieNDaAdN6ANoCEdA5JZAXcpLEnV9lChoBkdAt466/zreImgHTegDaAhHQOSWSS+i8Fp1fZQoaAZHQLakeB4D9wZoB03oA2gIR0Dklqa8lolEdX2UKGgGR0C24/HOnl4kaAdN6ANoCEdA5JpJOz6acHV9lChoBkdAt1bLeXRgJGgHTegDaAhHQOSbIIMKCxx1fZQoaAZHQLIEeee4Cp5oB00RA2gIR0DkmzXdfLLZdX2UKGgGR0C3S8YAfdRBaAdN6ANoCEdA5JumjaXa8HV9lChoBkdAt4k0vnKW9mgHTegDaAhHQOScBY3eenR1fZQoaAZHQLcEeCQLeANoB03oA2gIR0Dkn56QCjk/dX2UKGgGR0C2+/RJVbRnaAdN6ANoCEdA5KCIE/B3zXV9lChoBkdAtr6G+ajN6mgHTegDaAhHQOSgoU4//vR1fZQoaAZHQLadBZIxxkxoB03oA2gIR0DkoSPCN0eVdX2UKGgGR0CvLKpzcRDkaAdNzwJoCEdA5KE1cw5/9nV9lChoBkdAtwFQKw6hg2gHTegDaAhHQOShi32RJVd1fZQoaAZHQLd4dBHkLhJoB03oA2gIR0DkpdrLIxQBdX2UKGgGR0C3HhLc0tROaAdN6ANoCEdA5KXyJ40Mw3V9lChoBkdAtxlvXYlIE2gHTegDaAhHQOSmcUTYdyV1fZQoaAZHQLZKWlt0mt1oB03oA2gIR0DkpoQDLbHqdX2UKGgGR0C2oxY9xIataAdN6ANoCEdA5KbdcsDnvHV9lChoBkdAtuhtYoy9EmgHTegDaAhHQOSrchlpXZJ1fZQoaAZHQLcjEUuctoVoB03oA2gIR0Dkq4g2oegddX2UKGgGR0C2PEjsUqQSaAdN6ANoCEdA5Kv5j0+TvHV9lChoBkdAtp+g+KTB7GgHTegDaAhHQOSsCk0tRN11fZQoaAZHQLfifZjQRf5oB03oA2gIR0DkrF8sDnvEdX2UKGgGR0C3eEs8gZCOaAdN6ANoCEdA5LCkq94/vHV9lChoBkdAt5RorDqGDmgHTegDaAhHQOSwvAzJp351fZQoaAZHQLcqOtY0VJtoB03oA2gIR0DksTDE2HcldX2UKGgGR0C3LToIrvsraAdN6ANoCEdA5LFDmQCCBnV9lChoBkdAt7DQx46fa2gHTegDaAhHQOSxnhc7heh1fZQoaAZHQLc2yCF9KEpoB03oA2gIR0DktdL/b0vodX2UKGgGR0C2iKWW6bvxaAdN6ANoCEdA5LXp/7SApnV9lChoBkdAt0US2a2F4GgHTegDaAhHQOS2ZoKpkwx1fZQoaAZHQLdIEOjZcs1oB03oA2gIR0Dktnq4S6DodX2UKGgGR0C2HOuJUHY6aAdN6ANoCEdA5LbTUMgEEHVlLg=="}, "ep_success_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="}, "_n_updates": 1137999, "buffer_size": 1000000, "batch_size": 256, "learning_starts": 10000, "tau": 0.005, "gamma": 0.99, "gradient_steps": 1, "optimize_memory_usage": false, "replay_buffer_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVNQAAAAAAAACMIHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5idWZmZXJzlIwMUmVwbGF5QnVmZmVylJOULg==", "__module__": "stable_baselines3.common.buffers", "__annotations__": "{'observations': <class 'numpy.ndarray'>, 'next_observations': <class 'numpy.ndarray'>, 'actions': <class 'numpy.ndarray'>, 'rewards': <class 'numpy.ndarray'>, 'dones': <class 'numpy.ndarray'>, 'timeouts': <class 'numpy.ndarray'>}", "__doc__": "\n Replay buffer used in off-policy algorithms like SAC/TD3.\n\n :param buffer_size: Max number of element in the buffer\n :param observation_space: Observation space\n :param action_space: Action space\n :param device: PyTorch device\n :param n_envs: Number of parallel environments\n :param optimize_memory_usage: Enable a memory efficient variant\n of the replay buffer which reduces by almost a factor two the memory used,\n at a cost of more complexity.\n See https://github.com/DLR-RM/stable-baselines3/issues/37#issuecomment-637501195\n and https://github.com/DLR-RM/stable-baselines3/pull/28#issuecomment-637559274\n Cannot be used in combination with handle_timeout_termination.\n :param handle_timeout_termination: Handle timeout termination (due to timelimit)\n separately and treat the task as infinite horizon task.\n https://github.com/DLR-RM/stable-baselines3/issues/284\n ", "__init__": "<function ReplayBuffer.__init__ at 0x7f7390ae4d60>", "add": "<function ReplayBuffer.add at 0x7f7390ae4ea0>", "sample": "<function ReplayBuffer.sample at 0x7f7390ae4f40>", "_get_samples": "<function ReplayBuffer._get_samples at 0x7f7390ae4fe0>", "_maybe_cast_dtype": "<staticmethod(<function ReplayBuffer._maybe_cast_dtype at 0x7f7390ae5080>)>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x7f7390ade540>"}, "replay_buffer_kwargs": {}, "train_freq": {":type:": "<class 'stable_baselines3.common.type_aliases.TrainFreq'>", ":serialized:": "gAWVYQAAAAAAAACMJXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi50eXBlX2FsaWFzZXOUjAlUcmFpbkZyZXGUk5RLAWgAjBJUcmFpbkZyZXF1ZW5jeVVuaXSUk5SMBHN0ZXCUhZRSlIaUgZQu"}, "use_sde_at_warmup": false, "target_entropy": -6.0, "ent_coef": "auto", "target_update_interval": 1, "observation_space": {":type:": "<class 'gymnasium.spaces.box.Box'>", ":serialized:": "gAWVsQIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY4lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMBl9zaGFwZZRLEYWUjANsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWiAAAAAAAAAAAAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/lGgLSxGFlIwBQ5R0lFKUjA1ib3VuZGVkX2JlbG93lGgTKJYRAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAlGgIjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxGFlGgWdJRSlIwEaGlnaJRoEyiWiAAAAAAAAAAAAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/lGgLSxGFlGgWdJRSlIwNYm91bmRlZF9hYm92ZZRoEyiWEQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAJRoHUsRhZRoFnSUUpSMCGxvd19yZXBylIwELWluZpSMCWhpZ2hfcmVwcpSMA2luZpSMCl9ucF9yYW5kb22UTnViLg==", "dtype": "float64", "_shape": [17], "low": "[-inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf\n -inf -inf -inf]", "bounded_below": "[False False False False False False False False False False False False\n False False False False False]", "high": "[inf inf inf inf inf inf inf inf inf inf inf inf inf inf inf inf inf]", "bounded_above": "[False False False False False False False False False False False False\n False False False False False]", "low_repr": "-inf", "high_repr": "inf", "_np_random": null}, "action_space": {":type:": "<class 'gymnasium.spaces.box.Box'>", ":serialized:": "gAWVfgIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMBl9zaGFwZZRLBoWUjANsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWGAAAAAAAAAAAAIC/AACAvwAAgL8AAIC/AACAvwAAgL+UaAtLBoWUjAFDlHSUUpSMDWJvdW5kZWRfYmVsb3eUaBMolgYAAAAAAAAAAQEBAQEBlGgIjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSwaFlGgWdJRSlIwEaGlnaJRoEyiWGAAAAAAAAAAAAIA/AACAPwAAgD8AAIA/AACAPwAAgD+UaAtLBoWUaBZ0lFKUjA1ib3VuZGVkX2Fib3ZllGgTKJYGAAAAAAAAAAEBAQEBAZRoHUsGhZRoFnSUUpSMCGxvd19yZXBylIwELTEuMJSMCWhpZ2hfcmVwcpSMAzEuMJSMCl9ucF9yYW5kb22UjBRudW1weS5yYW5kb20uX3BpY2tsZZSMEF9fZ2VuZXJhdG9yX2N0b3KUk5SMBVBDRzY0lGgyjBRfX2JpdF9nZW5lcmF0b3JfY3RvcpSTlIaUUpR9lCiMDWJpdF9nZW5lcmF0b3KUjAVQQ0c2NJSMBXN0YXRllH2UKGg9ihDjYZWmt15YCS1Fllk0taEajANpbmOUihCpc3hEvDOBWIIa9zrb2o1BdYwKaGFzX3VpbnQzMpRLAIwIdWludGVnZXKUSwB1YnViLg==", "dtype": "float32", "_shape": [6], "low": "[-1. -1. -1. -1. -1. -1.]", "bounded_below": "[ True True True True True True]", "high": "[1. 1. 1. 1. 1. 1.]", "bounded_above": "[ True True True True True True]", "low_repr": "-1.0", "high_repr": "1.0", "_np_random": "Generator(PCG64)"}, "n_envs": 5, "lr_schedule": {":type:": "<class 'function'>", ":serialized:": "gAWVRgQAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLBUsTQyaVAZcAdAEAAAAAAAAAAAIAiQF8AKsBAAAAAAAAqwEAAAAAAABTAJROhZSMBWZsb2F0lIWUjBJwcm9ncmVzc19yZW1haW5pbmeUhZSMaC9ob21lL21hc3Rlci1hbmRyZWFzL2dlbl9kYXRhc2V0L3Rlc3RfZW52L2xpYi9weXRob24zLjEyL3NpdGUtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lIwIPGxhbWJkYT6UjCFnZXRfc2NoZWR1bGVfZm4uPGxvY2Fscz4uPGxhbWJkYT6US2FDEPiAAKRlqU7QO03TLE7UJk+UQwCUjA52YWx1ZV9zY2hlZHVsZZSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjGgvaG9tZS9tYXN0ZXItYW5kcmVhcy9nZW5fZGF0YXNldC90ZXN0X2Vudi9saWIvcHl0aG9uMy4xMi9zaXRlLXBhY2thZ2VzL3N0YWJsZV9iYXNlbGluZXMzL2NvbW1vbi91dGlscy5weZR1Tk5oAIwQX21ha2VfZW1wdHlfY2VsbJSTlClSlIWUdJRSlGgAjBJfZnVuY3Rpb25fc2V0c3RhdGWUk5RoI32UfZQoaBqMCDxsYW1iZGE+lIwMX19xdWFsbmFtZV9flIwhZ2V0X3NjaGVkdWxlX2ZuLjxsb2NhbHM+LjxsYW1iZGE+lIwPX19hbm5vdGF0aW9uc19flH2UjA5fX2t3ZGVmYXVsdHNfX5ROjAxfX2RlZmF1bHRzX1+UTowKX19tb2R1bGVfX5RoG4wHX19kb2NfX5ROjAtfX2Nsb3N1cmVfX5RoAIwKX21ha2VfY2VsbJSTlGgCKGgHKEsBSwBLAEsBSwFLE0MIlQGXAIkBUwCUaAkpjAFflIWUaA6MBGZ1bmOUjBljb25zdGFudF9mbi48bG9jYWxzPi5mdW5jlEuFQwj4gADYDxKICpRoEowDdmFslIWUKXSUUpRoF05OaB8pUpSFlHSUUpRoJWhBfZR9lChoGowEZnVuY5RoKYwZY29uc3RhbnRfZm4uPGxvY2Fscz4uZnVuY5RoK32UaC1OaC5OaC9oG2gwTmgxaDNHPzOpKjBVMmGFlFKUhZSMF19jbG91ZHBpY2tsZV9zdWJtb2R1bGVzlF2UjAtfX2dsb2JhbHNfX5R9lHWGlIZSMIWUUpSFlGhKXZRoTH2UdYaUhlIwLg=="}, "batch_norm_stats": [], "batch_norm_stats_target": [], "system_info": {"OS": "Linux-6.1.125-1-MANJARO-x86_64-with-glibc2.40 # 1 SMP PREEMPT_DYNAMIC Fri Jan 17 15:04:03 UTC 2025", "Python": "3.12.8", "Stable-Baselines3": "2.4.1", "PyTorch": "2.5.1+cu124", "GPU Enabled": "True", "Numpy": "1.26.4", "Cloudpickle": "3.1.0", "Gymnasium": "1.0.0"}}
results.json CHANGED
@@ -1 +1 @@
1
- {"mean_reward": 6154.4426466, "std_reward": 787.8395217227671, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-11-13T06:56:17.497543"}
 
1
+ {"mean_reward": 6173.857775197, "std_reward": 197.8644936626215, "is_deterministic": true, "n_eval_episodes": 1000, "eval_datetime": "2025-01-26T14:40:57.638935"}
walker2d-v5-SAC-medium.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:57e909eed1b99e22e520f032bc6a69c096957059a822709b9af5b47ebc7afc7c
3
+ size 3242585
walker2d-v5-SAC-medium/_stable_baselines3_version ADDED
@@ -0,0 +1 @@
 
 
1
+ 2.4.1
walker2d-v5-SAC-medium/actor.optimizer.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:01ec929a286f5ee250cbc575b4a0e2f0679a07a3efa21f05a034570515009671
3
+ size 594766
walker2d-v5-SAC-medium/critic.optimizer.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b786eacd04c6b60710584659b8c39d155f50646b8a33d128ca132b0aa3786228
3
+ size 1165226
walker2d-v5-SAC-medium/data ADDED
@@ -0,0 +1,125 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "policy_class": {
3
+ ":type:": "<class 'abc.ABCMeta'>",
4
+ ":serialized:": "gAWVMAAAAAAAAACMHnN0YWJsZV9iYXNlbGluZXMzLnNhYy5wb2xpY2llc5SMCVNBQ1BvbGljeZSTlC4=",
5
+ "__module__": "stable_baselines3.sac.policies",
6
+ "__annotations__": "{'actor': <class 'stable_baselines3.sac.policies.Actor'>, 'critic': <class 'stable_baselines3.common.policies.ContinuousCritic'>, 'critic_target': <class 'stable_baselines3.common.policies.ContinuousCritic'>}",
7
+ "__doc__": "\n Policy class (with both actor and critic) for SAC.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param use_expln: Use ``expln()`` function instead of ``exp()`` when using gSDE to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param clip_mean: Clip the mean output when using gSDE to avoid numerical instability.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n :param n_critics: Number of critic networks to create.\n :param share_features_extractor: Whether to share or not the features extractor\n between the actor and the critic (this saves computation time)\n ",
8
+ "__init__": "<function SACPolicy.__init__ at 0x7f7385c0ce00>",
9
+ "_build": "<function SACPolicy._build at 0x7f7385c0d3a0>",
10
+ "_get_constructor_parameters": "<function SACPolicy._get_constructor_parameters at 0x7f7385c0d440>",
11
+ "reset_noise": "<function SACPolicy.reset_noise at 0x7f7385c0d4e0>",
12
+ "make_actor": "<function SACPolicy.make_actor at 0x7f7385c0d580>",
13
+ "make_critic": "<function SACPolicy.make_critic at 0x7f7385c0d620>",
14
+ "forward": "<function SACPolicy.forward at 0x7f7385c0d6c0>",
15
+ "_predict": "<function SACPolicy._predict at 0x7f7385c0d760>",
16
+ "set_training_mode": "<function SACPolicy.set_training_mode at 0x7f7385c0d800>",
17
+ "__abstractmethods__": "frozenset()",
18
+ "_abc_impl": "<_abc._abc_data object at 0x7f7385c14980>"
19
+ },
20
+ "verbose": 0,
21
+ "policy_kwargs": {
22
+ "use_sde": false
23
+ },
24
+ "num_timesteps": 5700000,
25
+ "_total_timesteps": 6000000,
26
+ "_num_timesteps_at_start": 0,
27
+ "seed": 0,
28
+ "action_noise": null,
29
+ "start_time": 1736092312186653182,
30
+ "learning_rate": 0.0003,
31
+ "tensorboard_log": "runs/0",
32
+ "_last_obs": {
33
+ ":type:": "<class 'numpy.ndarray'>",
34
+ ":serialized:": "gAWVHQMAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJaoAgAAAAAAANLMTdozCfI/satoOoJG5L/SVamXUwHuvw3X8FCIFI0/uFYtKR0f6j/qxHl1Ej+SPxffvjhb0/S/oq/Hsb0P6j+kN+0KVu3uPx3d0qRbkcS/hCy50gZDxr8Cr4lFcu7zv1xTbqcGsq6/yLETqkOeyr+QSI159FPSv/5/TrSfKfa/HpksfFEsez8zqr8Gg1fzP47RUznpF4o/ZW0w9ot/27/0SdYlPxqaP/GDfy5/XNG/izaSumEjlT9mLiE7Vcryv0PIOjvtwOK/qr6tBSnSE0Dhro1KoQDcP6kniINh6QTAc77EtxfTHcBmcuqvraLNv9oFTcdpYQjA3hVuttWI6b+c8jiXLgILwEAgON4zfCNAV14V5QSc8z+WfqFUEbHJP44C0FR5Fc+/9Z36I49biD+jFn9v2RvrPyXNwXoAS5s/PTz/PC/I5b+jfj+ZU6jsvwQQn4gJUxNA1RLG9tKD8T8nDOnH0CcDQN1GKoXK1B5AgfPEKSiFxD8SLjex1Wzhv+MH871Se96/6KRKVT4XG0AAAAAAAAAkwCVUkGvtC/M/mo3RJb/+wb+/anTh22ajP16fYNuiSY4/WDLHa/nr6T/rywjqcx/Mv52s0Sqjk++/FxSGRqe+6r8PnSR7B3wZQKDMjJDxON2/9DxhwX1U/T9E48LvkSaQv83dji1F1Oy/THsx9wCc6L+J4OOwlwURQBftjJPRkh7AwzcJ+GzgIUDsa1TG5hj1P2yCF2SWz9E/OTJtNRRn07/oZ1g5lM+iP0RmduFqvuo/Th8iPGQ6kz+lfHA11l7nv5BR6rPUauq/Fqgiit96FEAjbfyHei/BP9KZ0pCOjQFARekgkcxlHEDOCvHlKBXbv8jVbmLK89+/diF+z0ferL+ClB0KYxsFwJzz5tNVr9k/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksFSxGGlIwBQ5R0lFKULg=="
35
+ },
36
+ "_last_episode_starts": {
37
+ ":type:": "<class 'numpy.ndarray'>",
38
+ ":serialized:": "gAWVeAAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYFAAAAAAAAAAEBAQEBlIwFbnVtcHmUjAVkdHlwZZSTlIwCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksFhZSMAUOUdJRSlC4="
39
+ },
40
+ "_last_original_obs": {
41
+ ":type:": "<class 'numpy.ndarray'>",
42
+ ":serialized:": "gAWVHQMAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJaoAgAAAAAAAM0RQnkXC/I/W8cSOHiC5L/w1FXvMfvtv9iLRBA/mIw/ZmRWFFso6j/KxEucHRuBP3PA6yL0kPS//MzM+MkO6j85gaR45V7yP8xzKj4NCqc/jYxARPoWAEB6xBviDM/wPwAiIzh5bsI/CJI4cJxvqr8N0/j6fVcFQDODz8TMdgXAbEfLFqlJmT9+ySNPbErzP0i4Uvid3qE/NOho1KWB17/t+8o3f+ibP200U0avG9C/ninVW3XdnT9MMIq5ilfyv0CJ39IcSOW/8rEN5zLjE0DJzLMqLSDXP2n8y7JjswfAp66z261TIMBdjFyChQXJv/GwF/+9agDA0PYB0eQZ9r91CHdjZe4MwAAAAAAAACRAzf3n4nhz8z/GcLYB00XHP3xlgpgZhdO/VONUmT7BhT9okJg/2yDrP1/ChMurPp8/hO1aV9mv57/9ywol1yzmv7I4HD0djxNAGLQrBi6e9D9EwC/KwtEDQKwPqFaFyh9AhcfUM3XQxD8swwrWMcThPzFTfkbeC+C/dDhe9SGOHEAAAAAAAAAkwLswlF0vHfM/i+jBh0Grw78rQW2988KjP46rDgckuZA/BRoQxZb26T9x3HuutjzQv3V3lpWxg+2/9aAnZ/OR7L+p4t890DMZQGQhHOFW1t+/tNV5CcAy9z/pNM9/s+7Hv6rdyaWair2/P5GJCDm5xb+DA19TjN0QQHyCfpgDZSDA2ewFH7jjE0Cog3yH6RH1P0z0Tu4we9A/aiJcCHJF1787VK+XCXukP+Z8/l4j4uo//tn28Ddkkz8f17ngluPmv4mXB+lSguq/lSLU3YhOFUAcAmfetdDSP/4D2hIbGwhA7yZVL+QKIEB9+XNGWAXYvzOLJWe+0eK/3NzWyyfEmz8yIeagndrxvz04s/mjl9I/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksFSxGGlIwBQ5R0lFKULg=="
43
+ },
44
+ "_episode_num": 8819,
45
+ "use_sde": false,
46
+ "sde_sample_freq": -1,
47
+ "_current_progress_remaining": 0.05000083333333338,
48
+ "_stats_window_size": 100,
49
+ "ep_info_buffer": {
50
+ ":type:": "<class 'collections.deque'>",
51
+ ":serialized:": "gAWVRAwAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQLapVuB+WnmMAWyUTegDjAF0lEdA5FU+5KWcBnV9lChoBkdAtxCUG/vfCWgHTegDaAhHQORVSDBTGYN1fZQoaAZHQLbZBz0HyEtoB03oA2gIR0DkVVA83dbgdX2UKGgGR0C2bQzKs+3ZaAdN6ANoCEdA5FnNIOQQtnV9lChoBkdAtyOWf9P1tmgHTegDaAhHQORZ1M5sCT51fZQoaAZHQLbMiDAaef9oB03oA2gIR0DkWlZ8lXzUdX2UKGgGR0C3Z5DTz/ZNaAdN6ANoCEdA5FpewCbMHXV9lChoBkdAt1hkDp1RtWgHTegDaAhHQORaZfck+ot1fZQoaAZHQLdRio5ggHNoB03oA2gIR0DkXx63IdU9dX2UKGgGR0C3QrEUO/cnaAdN6ANoCEdA5F8lAYYR/XV9lChoBkdAtwAMFGG21GgHTegDaAhHQORfo96HCXR1fZQoaAZHQLbWaMK1G9ZoB03oA2gIR0DkX6vt2LYPdX2UKGgGR0C3cHFc+qzaaAdN6ANoCEdA5F+zj6WPcXV9lChoBkdAtye0wudwvWgHTegDaAhHQORkL6SmqHZ1fZQoaAZHQLaRG/9pAUtoB03oA2gIR0DkZDYZFXq8dX2UKGgGR0C21YAfuCwsaAdN6ANoCEdA5GSz7EHdGnV9lChoBkdAty5QCeVcEGgHTegDaAhHQORkvD5TIeZ1fZQoaAZHQLc3/YcNpdtoB03oA2gIR0DkZMOH8jzJdX2UKGgGR0C3IK8TnJT3aAdN6ANoCEdA5GkJ8xKxs3V9lChoBkdAtwjjp5eJHmgHTegDaAhHQORpEMasIVx1fZQoaAZHQLZ8YRkVerxoB03oA2gIR0DkaZJI4lyBdX2UKGgGR0C3OVNW6shgaAdN6ANoCEdA5GmbF6RhdHV9lChoBkdAtxNG8+Roy2gHTegDaAhHQORpos0WM0h1fZQoaAZHQLegZS9/SYxoB03oA2gIR0DkbfiKzAvddX2UKGgGR0C3Gpp1ie/YaAdN6ANoCEdA5G3+fffoBHV9lChoBkdAtuP5Q/HHWGgHTegDaAhHQORudav5gw51fZQoaAZHQLazgRwZOzpoB03oA2gIR0Dkbn7fZ26kdX2UKGgGR0C3rSQc5sCUaAdN6ANoCEdA5G6GIqbz9XV9lChoBkdAtz1F29tdiWgHTegDaAhHQORzD4CU5dZ1fZQoaAZHQLcw0QYk3S9oB03oA2gIR0DkcxbfVI7OdX2UKGgGR0C3J6r/GVAzaAdN6ANoCEdA5HOk7iZOSHV9lChoBkdAtv4yckMTe2gHTegDaAhHQORzrY6jnFJ1fZQoaAZHQLanZ3/givBoB03oA2gIR0Dkc7XvAoG6dX2UKGgGR0C2qESr5qM4aAdN6ANoCEdA5HftjTa0yHV9lChoBkdAtysMjqv/zmgHTegDaAhHQOR39AL1EmZ1fZQoaAZHQLcFRcpLEk1oB03oA2gIR0DkeHLgogFHdX2UKGgGR0C1sxR/3FkyaAdN6ANoCEdA5Hh8Qu/UOXV9lChoBkdAtwO/HcUM5WgHTegDaAhHQOR4g8hvBJt1fZQoaAZHQJVBm7EpAlhoB00sAWgIR0Dke7R4X40udX2UKGgGR0C2+wAGfPHDaAdN6ANoCEdA5HyKxHoX9HV9lChoBkdAtuKi+UQkHGgHTegDaAhHQOR8kS9RJmN1fZQoaAZHQLa9qCf6Gg1oB03oA2gIR0DkfRTUF0PpdX2UKGgGR0C2fi2V7hNuaAdN6ANoCEdA5H0cr0aqCHV9lChoBkdAoa9DpmmLtWgHTckBaAhHQOR9f5eXzDp1fZQoaAZHQLbSaocaOxVoB03oA2gIR0Dkf9iOe8PGdX2UKGgGR0C21WAT7EYPaAdN6ANoCEdA5ICqNWU8m3V9lChoBkdAt2AOtyPuHGgHTegDaAhHQOSBN1yvLYB1fZQoaAZHQLaoV3M6ikBoB03oA2gIR0DkgUBoFmnPdX2UKGgGR0C22pJ0KZ2IaAdN6ANoCEdA5IGvf8VHnXV9lChoBkdAtzTYAWBSUGgHTegDaAhHQOSFJEi8nNR1fZQoaAZHQLePsR1X/5toB03oA2gIR0DkhgXar3j/dX2UKGgGR0C3fRUr9VFQaAdN6ANoCEdA5IaQj2i+L3V9lChoBkdAt38vHp8neGgHTegDaAhHQOSGmidxyXF1fZQoaAZHQLdZyzMibDxoB03oA2gIR0DkhwGeT3ZgdX2UKGgGR0C3IvCO3lS1aAdN6ANoCEdA5Ipvae5Fw3V9lChoBkdAtvlbDtPYWmgHTegDaAhHQOSLT2OIZZV1fZQoaAZHQLc0P1c+qzZoB03oA2gIR0Dki+CD7IkrdX2UKGgGR0C25IzhHbypaAdN6ANoCEdA5Ivp/SQYDXV9lChoBkdAt1nQw9JSSGgHTegDaAhHQOSMWmjM3ZR1fZQoaAZHQLeVZIsRQJpoB03oA2gIR0Dkj5TILgGbdX2UKGgGR0C2+4nY+Sr6aAdN6ANoCEdA5JCF5Nwiq3V9lChoBkdAtwWTyxzJZGgHTegDaAhHQOSRFP1anrJ1fZQoaAZHQLcnUcvduYRoB03oA2gIR0DkkR0QaJhwdX2UKGgGR0C3NmwT/Q0GaAdN6ANoCEdA5JGDCcwxnHV9lChoBkdAt4vcNCqp+GgHTegDaAhHQOSU9SvX9R91fZQoaAZHQLc9HGJemeloB03oA2gIR0Dklb1oC+10dX2UKGgGR0C3smYgieNDaAdN6ANoCEdA5JZAXcpLEnV9lChoBkdAt466/zreImgHTegDaAhHQOSWSS+i8Fp1fZQoaAZHQLakeB4D9wZoB03oA2gIR0Dklqa8lolEdX2UKGgGR0C24/HOnl4kaAdN6ANoCEdA5JpJOz6acHV9lChoBkdAt1bLeXRgJGgHTegDaAhHQOSbIIMKCxx1fZQoaAZHQLIEeee4Cp5oB00RA2gIR0DkmzXdfLLZdX2UKGgGR0C3S8YAfdRBaAdN6ANoCEdA5JumjaXa8HV9lChoBkdAt4k0vnKW9mgHTegDaAhHQOScBY3eenR1fZQoaAZHQLcEeCQLeANoB03oA2gIR0Dkn56QCjk/dX2UKGgGR0C2+/RJVbRnaAdN6ANoCEdA5KCIE/B3zXV9lChoBkdAtr6G+ajN6mgHTegDaAhHQOSgoU4//vR1fZQoaAZHQLadBZIxxkxoB03oA2gIR0DkoSPCN0eVdX2UKGgGR0CvLKpzcRDkaAdNzwJoCEdA5KE1cw5/9nV9lChoBkdAtwFQKw6hg2gHTegDaAhHQOShi32RJVd1fZQoaAZHQLd4dBHkLhJoB03oA2gIR0DkpdrLIxQBdX2UKGgGR0C3HhLc0tROaAdN6ANoCEdA5KXyJ40Mw3V9lChoBkdAtxlvXYlIE2gHTegDaAhHQOSmcUTYdyV1fZQoaAZHQLZKWlt0mt1oB03oA2gIR0DkpoQDLbHqdX2UKGgGR0C2oxY9xIataAdN6ANoCEdA5KbdcsDnvHV9lChoBkdAtuhtYoy9EmgHTegDaAhHQOSrchlpXZJ1fZQoaAZHQLcjEUuctoVoB03oA2gIR0Dkq4g2oegddX2UKGgGR0C2PEjsUqQSaAdN6ANoCEdA5Kv5j0+TvHV9lChoBkdAtp+g+KTB7GgHTegDaAhHQOSsCk0tRN11fZQoaAZHQLfifZjQRf5oB03oA2gIR0DkrF8sDnvEdX2UKGgGR0C3eEs8gZCOaAdN6ANoCEdA5LCkq94/vHV9lChoBkdAt5RorDqGDmgHTegDaAhHQOSwvAzJp351fZQoaAZHQLcqOtY0VJtoB03oA2gIR0DksTDE2HcldX2UKGgGR0C3LToIrvsraAdN6ANoCEdA5LFDmQCCBnV9lChoBkdAt7DQx46fa2gHTegDaAhHQOSxnhc7heh1fZQoaAZHQLc2yCF9KEpoB03oA2gIR0DktdL/b0vodX2UKGgGR0C2iKWW6bvxaAdN6ANoCEdA5LXp/7SApnV9lChoBkdAt0US2a2F4GgHTegDaAhHQOS2ZoKpkwx1fZQoaAZHQLdIEOjZcs1oB03oA2gIR0Dktnq4S6DodX2UKGgGR0C2HOuJUHY6aAdN6ANoCEdA5LbTUMgEEHVlLg=="
52
+ },
53
+ "ep_success_buffer": {
54
+ ":type:": "<class 'collections.deque'>",
55
+ ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="
56
+ },
57
+ "_n_updates": 1137999,
58
+ "buffer_size": 1000000,
59
+ "batch_size": 256,
60
+ "learning_starts": 10000,
61
+ "tau": 0.005,
62
+ "gamma": 0.99,
63
+ "gradient_steps": 1,
64
+ "optimize_memory_usage": false,
65
+ "replay_buffer_class": {
66
+ ":type:": "<class 'abc.ABCMeta'>",
67
+ ":serialized:": "gAWVNQAAAAAAAACMIHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5idWZmZXJzlIwMUmVwbGF5QnVmZmVylJOULg==",
68
+ "__module__": "stable_baselines3.common.buffers",
69
+ "__annotations__": "{'observations': <class 'numpy.ndarray'>, 'next_observations': <class 'numpy.ndarray'>, 'actions': <class 'numpy.ndarray'>, 'rewards': <class 'numpy.ndarray'>, 'dones': <class 'numpy.ndarray'>, 'timeouts': <class 'numpy.ndarray'>}",
70
+ "__doc__": "\n Replay buffer used in off-policy algorithms like SAC/TD3.\n\n :param buffer_size: Max number of element in the buffer\n :param observation_space: Observation space\n :param action_space: Action space\n :param device: PyTorch device\n :param n_envs: Number of parallel environments\n :param optimize_memory_usage: Enable a memory efficient variant\n of the replay buffer which reduces by almost a factor two the memory used,\n at a cost of more complexity.\n See https://github.com/DLR-RM/stable-baselines3/issues/37#issuecomment-637501195\n and https://github.com/DLR-RM/stable-baselines3/pull/28#issuecomment-637559274\n Cannot be used in combination with handle_timeout_termination.\n :param handle_timeout_termination: Handle timeout termination (due to timelimit)\n separately and treat the task as infinite horizon task.\n https://github.com/DLR-RM/stable-baselines3/issues/284\n ",
71
+ "__init__": "<function ReplayBuffer.__init__ at 0x7f7390ae4d60>",
72
+ "add": "<function ReplayBuffer.add at 0x7f7390ae4ea0>",
73
+ "sample": "<function ReplayBuffer.sample at 0x7f7390ae4f40>",
74
+ "_get_samples": "<function ReplayBuffer._get_samples at 0x7f7390ae4fe0>",
75
+ "_maybe_cast_dtype": "<staticmethod(<function ReplayBuffer._maybe_cast_dtype at 0x7f7390ae5080>)>",
76
+ "__abstractmethods__": "frozenset()",
77
+ "_abc_impl": "<_abc._abc_data object at 0x7f7390ade540>"
78
+ },
79
+ "replay_buffer_kwargs": {},
80
+ "train_freq": {
81
+ ":type:": "<class 'stable_baselines3.common.type_aliases.TrainFreq'>",
82
+ ":serialized:": "gAWVYQAAAAAAAACMJXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi50eXBlX2FsaWFzZXOUjAlUcmFpbkZyZXGUk5RLAWgAjBJUcmFpbkZyZXF1ZW5jeVVuaXSUk5SMBHN0ZXCUhZRSlIaUgZQu"
83
+ },
84
+ "use_sde_at_warmup": false,
85
+ "target_entropy": -6.0,
86
+ "ent_coef": "auto",
87
+ "target_update_interval": 1,
88
+ "observation_space": {
89
+ ":type:": "<class 'gymnasium.spaces.box.Box'>",
90
+ ":serialized:": "gAWVsQIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY4lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMBl9zaGFwZZRLEYWUjANsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWiAAAAAAAAAAAAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/AAAAAAAA8P8AAAAAAADw/wAAAAAAAPD/lGgLSxGFlIwBQ5R0lFKUjA1ib3VuZGVkX2JlbG93lGgTKJYRAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAlGgIjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxGFlGgWdJRSlIwEaGlnaJRoEyiWiAAAAAAAAAAAAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/AAAAAAAA8H8AAAAAAADwfwAAAAAAAPB/lGgLSxGFlGgWdJRSlIwNYm91bmRlZF9hYm92ZZRoEyiWEQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAJRoHUsRhZRoFnSUUpSMCGxvd19yZXBylIwELWluZpSMCWhpZ2hfcmVwcpSMA2luZpSMCl9ucF9yYW5kb22UTnViLg==",
91
+ "dtype": "float64",
92
+ "_shape": [
93
+ 17
94
+ ],
95
+ "low": "[-inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf -inf\n -inf -inf -inf]",
96
+ "bounded_below": "[False False False False False False False False False False False False\n False False False False False]",
97
+ "high": "[inf inf inf inf inf inf inf inf inf inf inf inf inf inf inf inf inf]",
98
+ "bounded_above": "[False False False False False False False False False False False False\n False False False False False]",
99
+ "low_repr": "-inf",
100
+ "high_repr": "inf",
101
+ "_np_random": null
102
+ },
103
+ "action_space": {
104
+ ":type:": "<class 'gymnasium.spaces.box.Box'>",
105
+ ":serialized:": "gAWVfgIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMBl9zaGFwZZRLBoWUjANsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWGAAAAAAAAAAAAIC/AACAvwAAgL8AAIC/AACAvwAAgL+UaAtLBoWUjAFDlHSUUpSMDWJvdW5kZWRfYmVsb3eUaBMolgYAAAAAAAAAAQEBAQEBlGgIjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSwaFlGgWdJRSlIwEaGlnaJRoEyiWGAAAAAAAAAAAAIA/AACAPwAAgD8AAIA/AACAPwAAgD+UaAtLBoWUaBZ0lFKUjA1ib3VuZGVkX2Fib3ZllGgTKJYGAAAAAAAAAAEBAQEBAZRoHUsGhZRoFnSUUpSMCGxvd19yZXBylIwELTEuMJSMCWhpZ2hfcmVwcpSMAzEuMJSMCl9ucF9yYW5kb22UjBRudW1weS5yYW5kb20uX3BpY2tsZZSMEF9fZ2VuZXJhdG9yX2N0b3KUk5SMBVBDRzY0lGgyjBRfX2JpdF9nZW5lcmF0b3JfY3RvcpSTlIaUUpR9lCiMDWJpdF9nZW5lcmF0b3KUjAVQQ0c2NJSMBXN0YXRllH2UKGg9ihDjYZWmt15YCS1Fllk0taEajANpbmOUihCpc3hEvDOBWIIa9zrb2o1BdYwKaGFzX3VpbnQzMpRLAIwIdWludGVnZXKUSwB1YnViLg==",
106
+ "dtype": "float32",
107
+ "_shape": [
108
+ 6
109
+ ],
110
+ "low": "[-1. -1. -1. -1. -1. -1.]",
111
+ "bounded_below": "[ True True True True True True]",
112
+ "high": "[1. 1. 1. 1. 1. 1.]",
113
+ "bounded_above": "[ True True True True True True]",
114
+ "low_repr": "-1.0",
115
+ "high_repr": "1.0",
116
+ "_np_random": "Generator(PCG64)"
117
+ },
118
+ "n_envs": 5,
119
+ "lr_schedule": {
120
+ ":type:": "<class 'function'>",
121
+ ":serialized:": "gAWVRgQAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLBUsTQyaVAZcAdAEAAAAAAAAAAAIAiQF8AKsBAAAAAAAAqwEAAAAAAABTAJROhZSMBWZsb2F0lIWUjBJwcm9ncmVzc19yZW1haW5pbmeUhZSMaC9ob21lL21hc3Rlci1hbmRyZWFzL2dlbl9kYXRhc2V0L3Rlc3RfZW52L2xpYi9weXRob24zLjEyL3NpdGUtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lIwIPGxhbWJkYT6UjCFnZXRfc2NoZWR1bGVfZm4uPGxvY2Fscz4uPGxhbWJkYT6US2FDEPiAAKRlqU7QO03TLE7UJk+UQwCUjA52YWx1ZV9zY2hlZHVsZZSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjGgvaG9tZS9tYXN0ZXItYW5kcmVhcy9nZW5fZGF0YXNldC90ZXN0X2Vudi9saWIvcHl0aG9uMy4xMi9zaXRlLXBhY2thZ2VzL3N0YWJsZV9iYXNlbGluZXMzL2NvbW1vbi91dGlscy5weZR1Tk5oAIwQX21ha2VfZW1wdHlfY2VsbJSTlClSlIWUdJRSlGgAjBJfZnVuY3Rpb25fc2V0c3RhdGWUk5RoI32UfZQoaBqMCDxsYW1iZGE+lIwMX19xdWFsbmFtZV9flIwhZ2V0X3NjaGVkdWxlX2ZuLjxsb2NhbHM+LjxsYW1iZGE+lIwPX19hbm5vdGF0aW9uc19flH2UjA5fX2t3ZGVmYXVsdHNfX5ROjAxfX2RlZmF1bHRzX1+UTowKX19tb2R1bGVfX5RoG4wHX19kb2NfX5ROjAtfX2Nsb3N1cmVfX5RoAIwKX21ha2VfY2VsbJSTlGgCKGgHKEsBSwBLAEsBSwFLE0MIlQGXAIkBUwCUaAkpjAFflIWUaA6MBGZ1bmOUjBljb25zdGFudF9mbi48bG9jYWxzPi5mdW5jlEuFQwj4gADYDxKICpRoEowDdmFslIWUKXSUUpRoF05OaB8pUpSFlHSUUpRoJWhBfZR9lChoGowEZnVuY5RoKYwZY29uc3RhbnRfZm4uPGxvY2Fscz4uZnVuY5RoK32UaC1OaC5OaC9oG2gwTmgxaDNHPzOpKjBVMmGFlFKUhZSMF19jbG91ZHBpY2tsZV9zdWJtb2R1bGVzlF2UjAtfX2dsb2JhbHNfX5R9lHWGlIZSMIWUUpSFlGhKXZRoTH2UdYaUhlIwLg=="
122
+ },
123
+ "batch_norm_stats": [],
124
+ "batch_norm_stats_target": []
125
+ }
walker2d-v5-SAC-medium/ent_coef_optimizer.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:57498edc7ba53d4167e8a6cf3fea12dc0b994fa29e982ef479d2353f022dad1e
3
+ size 1940
walker2d-v5-SAC-medium/policy.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:698f4e2b8caaf4570da21c28c24b55ef86cc76e2cace3ffd2d2d5e1c72627c84
3
+ size 1460406
walker2d-v5-SAC-medium/pytorch_variables.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4b0a1c2c2a20737c449cb7809086a91b1dd11e06110fdba98d6ea4efaff744a5
3
+ size 1180
walker2d-v5-SAC-medium/system_info.txt ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ - OS: Linux-6.1.125-1-MANJARO-x86_64-with-glibc2.40 # 1 SMP PREEMPT_DYNAMIC Fri Jan 17 15:04:03 UTC 2025
2
+ - Python: 3.12.8
3
+ - Stable-Baselines3: 2.4.1
4
+ - PyTorch: 2.5.1+cu124
5
+ - GPU Enabled: True
6
+ - Numpy: 1.26.4
7
+ - Cloudpickle: 3.1.0
8
+ - Gymnasium: 1.0.0