ddpg-panda-reach-10 / README.md
StevanLS's picture
Update README.md
939ffde verified
---
library_name: stable-baselines3
tags:
- PandaReachJointsDense-v3
- deep-reinforcement-learning
- reinforcement-learning
- stable-baselines3
- DDPG
model-index:
- name: DDPG Panda Reach
results:
- task:
type: reinforcement-learning
name: Robot Manipulation
dataset:
name: PandaReachJointsDense-v3
type: panda-gym
metrics:
- type: mean_reward
value: REPLACE_WITH_ACTUAL_MEAN_REWARD # Replace with your model's mean reward
name: mean_reward
---
# DDPG Panda Reach Model
This is a DDPG (Deep Deterministic Policy Gradient) model trained to control a Panda robotic arm in a reaching task. The model was trained using Stable-Baselines3.
## Task Description
The task involves controlling a 7-DOF Panda robotic arm to reach a target position in 3D space. The environment provides dense rewards based on the distance between the end-effector and the target position.
## Training Details
- Environment: PandaReachJointsDense-v3
- Algorithm: DDPG with HER (Hindsight Experience Replay)
- Training Steps: 10,000
- Policy: MultiInputPolicy
- Training Framework: Stable-Baselines3
## Usage
```python
import gymnasium as gym
import panda_gym
from stable_baselines3 import DDPG
# Create environment
env = gym.make("PandaReachJointsDense-v3", render_mode="human")
# Load the trained model
model = DDPG.load("StevanLS/ddpg-panda-reach-10")
# Test the model
obs, _ = env.reset()
while True:
action, _ = model.predict(obs)
obs, reward, done, truncated, info = env.step(action)
if done or truncated:
obs, _ = env.reset()
```
## Author
- StevanLS
## Citations
```bibtex
@article{gymatorium2023,
author={Farama Foundation},
title={Gymnasium},
year={2023},
journal={GitHub repository},
publisher={GitHub},
url={https://github.com/Farama-Foundation/Gymnasium}
}
@article{raffin2021stable,
title={Stable-baselines3: Reliable reinforcement learning implementations},
author={Raffin, Antonin and Hill, Ashley and Gleave, Adam and Kanervisto, Anssi and Ernestus, Maximilian and Dormann, Noah},
journal={Journal of Machine Learning Research},
year={2021}
}
@article{gallouedec2021pandagym,
title={panda-gym: Open-Source Goal-Conditioned Environments for Robotic Learning},
author={Gallou{\'e}dec, Quentin and Cazin, Nicolas and Dellandr{\'e}a, Emmanuel and Chen, Liming},
journal={arXiv preprint arXiv:2106.13687},
year={2021}
}
```