loisonchambers
/

lunar2

Reinforcement Learning

deep-reinforcement-learning

custom-implementation

Eval Results (legacy)

Model card Files Files and versions

Metrics Training metrics Community

lunar2 / README.md

loisonchambers's picture

Upload folder using huggingface_hub

6f278bf verified about 1 month ago

|

history blame contribute delete

680 Bytes

	---
	tags:
	- ppo
	- LunarLander-v2
	- deep-reinforcement-learning
	- reinforcement-learning
	- custom-implementation
	- deep-rl-course
	model-index:
	- name: PPO-LunarLander-v2
	results:
	- task:
	type: reinforcement-learning
	name: reinforcement-learning
	dataset:
	name: LunarLander-v2
	type: LunarLander-v2
	metrics:
	- type: mean_reward
	value: 250.0 # Make sure this represents your actual performance
	name: mean_reward
	---

	# PPO Agent for LunarLander-v2

	This is a trained model of a PPO agent for LunarLander-v2.
	It was trained as part of the [Hugging Face Deep RL Course](https://huggingface.co/learn/deep-rl-course/unit8/introduction).