HAN Humanoid Balance PPO v1
Overview
This model is designed for humanoid robot balance control using Proximal Policy Optimization (PPO). It predicts joint torque adjustments to maintain stability during standing and walking.
Architecture
- Algorithm: Proximal Policy Optimization (PPO)
- Actor-Critic Network
- 3 Hidden Layers (256 units each)
- Activation: ReLU
Training Environment
- Physics-based humanoid simulation
- Variable terrain conditions
- Randomized push disturbances
Observation Space
- Joint positions
- Joint velocities
- IMU orientation
- Ground contact signals
Action Space
- Continuous joint torque outputs
Intended Use
- Humanoid robot balance research
- Reinforcement learning experiments
- Robotics simulation projects
Limitations
This model is trained in simulation and may require fine-tuning for real-world deployment.
Author
Caplin43
- Downloads last month
- 5