HAN Humanoid Balance PPO v1

Overview

This model is designed for humanoid robot balance control using Proximal Policy Optimization (PPO). It predicts joint torque adjustments to maintain stability during standing and walking.

Architecture

  • Algorithm: Proximal Policy Optimization (PPO)
  • Actor-Critic Network
  • 3 Hidden Layers (256 units each)
  • Activation: ReLU

Training Environment

  • Physics-based humanoid simulation
  • Variable terrain conditions
  • Randomized push disturbances

Observation Space

  • Joint positions
  • Joint velocities
  • IMU orientation
  • Ground contact signals

Action Space

  • Continuous joint torque outputs

Intended Use

  • Humanoid robot balance research
  • Reinforcement learning experiments
  • Robotics simulation projects

Limitations

This model is trained in simulation and may require fine-tuning for real-world deployment.

Author

Caplin43

Downloads last month
5
Video Preview
loading