StevanLS
/

ddpg-panda-reach-10

Reinforcement Learning

stable-baselines3

PandaReachJointsDense-v3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

ddpg-panda-reach-10 / README.md

StevanLS's picture

Update README.md

939ffde verified about 1 year ago

|

history blame contribute delete

2.46 kB

	---
	library_name: stable-baselines3
	tags:
	- PandaReachJointsDense-v3
	- deep-reinforcement-learning
	- reinforcement-learning
	- stable-baselines3
	- DDPG
	model-index:
	- name: DDPG Panda Reach
	results:
	- task:
	type: reinforcement-learning
	name: Robot Manipulation
	dataset:
	name: PandaReachJointsDense-v3
	type: panda-gym
	metrics:
	- type: mean_reward
	value: REPLACE_WITH_ACTUAL_MEAN_REWARD # Replace with your model's mean reward
	name: mean_reward

	---

	# DDPG Panda Reach Model

	This is a DDPG (Deep Deterministic Policy Gradient) model trained to control a Panda robotic arm in a reaching task. The model was trained using Stable-Baselines3.

	## Task Description

	The task involves controlling a 7-DOF Panda robotic arm to reach a target position in 3D space. The environment provides dense rewards based on the distance between the end-effector and the target position.

	## Training Details

	- Environment: PandaReachJointsDense-v3
	- Algorithm: DDPG with HER (Hindsight Experience Replay)
	- Training Steps: 10,000
	- Policy: MultiInputPolicy
	- Training Framework: Stable-Baselines3

	## Usage

	```python
	import gymnasium as gym
	import panda_gym
	from stable_baselines3 import DDPG

	# Create environment
	env = gym.make("PandaReachJointsDense-v3", render_mode="human")

	# Load the trained model
	model = DDPG.load("StevanLS/ddpg-panda-reach-10")

	# Test the model
	obs, _ = env.reset()
	while True:
	action, _ = model.predict(obs)
	obs, reward, done, truncated, info = env.step(action)
	if done or truncated:
	obs, _ = env.reset()
	```

	## Author

	- StevanLS

	## Citations

	```bibtex
	@article{gymatorium2023,
	author={Farama Foundation},
	title={Gymnasium},
	year={2023},
	journal={GitHub repository},
	publisher={GitHub},
	url={https://github.com/Farama-Foundation/Gymnasium}
	}

	@article{raffin2021stable,
	title={Stable-baselines3: Reliable reinforcement learning implementations},
	author={Raffin, Antonin and Hill, Ashley and Gleave, Adam and Kanervisto, Anssi and Ernestus, Maximilian and Dormann, Noah},
	journal={Journal of Machine Learning Research},
	year={2021}
	}

	@article{gallouedec2021pandagym,
	title={panda-gym: Open-Source Goal-Conditioned Environments for Robotic Learning},
	author={Gallou{\'e}dec, Quentin and Cazin, Nicolas and Dellandr{\'e}a, Emmanuel and Chen, Liming},
	journal={arXiv preprint arXiv:2106.13687},
	year={2021}
	}
	```