--- library_name: stable-baselines3 tags: - PandaReachJointsDense-v3 - deep-reinforcement-learning - reinforcement-learning - stable-baselines3 - DDPG model-index: - name: DDPG Panda Reach results: - task: type: reinforcement-learning name: Robot Manipulation dataset: name: PandaReachJointsDense-v3 type: panda-gym metrics: - type: mean_reward value: REPLACE_WITH_ACTUAL_MEAN_REWARD # Replace with your model's mean reward name: mean_reward --- # DDPG Panda Reach Model This is a DDPG (Deep Deterministic Policy Gradient) model trained to control a Panda robotic arm in a reaching task. The model was trained using Stable-Baselines3. ## Task Description The task involves controlling a 7-DOF Panda robotic arm to reach a target position in 3D space. The environment provides dense rewards based on the distance between the end-effector and the target position. ## Training Details - Environment: PandaReachJointsDense-v3 - Algorithm: DDPG with HER (Hindsight Experience Replay) - Training Steps: 10,000 - Policy: MultiInputPolicy - Training Framework: Stable-Baselines3 ## Usage ```python import gymnasium as gym import panda_gym from stable_baselines3 import DDPG # Create environment env = gym.make("PandaReachJointsDense-v3", render_mode="human") # Load the trained model model = DDPG.load("StevanLS/ddpg-panda-reach-10") # Test the model obs, _ = env.reset() while True: action, _ = model.predict(obs) obs, reward, done, truncated, info = env.step(action) if done or truncated: obs, _ = env.reset() ``` ## Author - StevanLS ## Citations ```bibtex @article{gymatorium2023, author={Farama Foundation}, title={Gymnasium}, year={2023}, journal={GitHub repository}, publisher={GitHub}, url={https://github.com/Farama-Foundation/Gymnasium} } @article{raffin2021stable, title={Stable-baselines3: Reliable reinforcement learning implementations}, author={Raffin, Antonin and Hill, Ashley and Gleave, Adam and Kanervisto, Anssi and Ernestus, Maximilian and Dormann, Noah}, journal={Journal of Machine Learning Research}, year={2021} } @article{gallouedec2021pandagym, title={panda-gym: Open-Source Goal-Conditioned Environments for Robotic Learning}, author={Gallou{\'e}dec, Quentin and Cazin, Nicolas and Dellandr{\'e}a, Emmanuel and Chen, Liming}, journal={arXiv preprint arXiv:2106.13687}, year={2021} } ```