Learns high-level decision policies for humanoid robots based on perceived states.
Base Model: distilbert-base-uncased