A2C PandaReachJointsDense-v3
This repository contains a Stable Baselines3 implementation of the Advantage Actor-Critic (A2C) algorithm trained on the PandaReachJointsDense-v3 environment from panda-gym. The model was trained for 500,000 timesteps to learn how to reach points in 3D space by controlling the robot's articulations.
Video Preview
Direct link: https://huggingface.co/Louisdlms/a2c-PandaReachJointsDense-v3/resolve/main/videos/panda_reach_result.mp4
- Downloads last month
- 40