lambdavi
/

ppo-SnowballTarget

Reinforcement Learning

deep-reinforcement-learning

ML-Agents-SnowballTarget

Model card Files Files and versions

Metrics Training metrics Community

lambdavi commited on Jan 15, 2024

Commit

7e7dde2

·

verified ·

1 Parent(s): 5343cfc

Update README.md

Files changed (1) hide show

README.md +46 -1

README.md CHANGED Viewed

@@ -32,4 +32,49 @@ tags:
   2. Step 1: Find your model_id: lambdavi/ppo-SnowballTarget
   3. Step 2: Select your *.nn /*.onnx file
   4. Click on Watch the agent play 👀

   2. Step 1: Find your model_id: lambdavi/ppo-SnowballTarget
   3. Step 2: Select your *.nn /*.onnx file
   4. Click on Watch the agent play 👀
+  ### Hyperparams used:
+  SnowballTarget:
+  	trainer_type:	ppo
+  	hyperparameters:
+  	  batch_size:	128
+  	  buffer_size:	2048
+  	  learning_rate:	0.005
+  	  beta:	0.005
+  	  epsilon:	0.2
+  	  lambd:	0.95
+  	  num_epoch:	5
+  	  shared_critic:	False
+  	  learning_rate_schedule:	linear
+  	  beta_schedule:	linear
+  	  epsilon_schedule:	linear
+  	checkpoint_interval:	50000
+  	network_settings:
+  	  normalize:	False
+  	  hidden_units:	256
+  	  num_layers:	2
+  	  vis_encode_type:	simple
+  	  memory:	None
+  	  goal_conditioning_type:	hyper
+  	  deterministic:	False
+  	reward_signals:
+  	  extrinsic:
+  	    gamma:	0.99
+  	    strength:	1.0
+  	    network_settings:
+  	      normalize:	False
+  	      hidden_units:	128
+  	      num_layers:	2
+  	      vis_encode_type:	simple
+  	      memory:	None
+  	      goal_conditioning_type:	hyper
+  	      deterministic:	False
+  	init_path:	None
+  	keep_checkpoints:	10
+  	even_checkpoints:	False
+  	max_steps:	500000
+  	time_horizon:	64
+  	summary_freq:	10000
+  	threaded:	True
+  	self_play:	None
+  	behavioral_cloning:	None