A BERT-based text classification model.
Classifies robot command instructions.
Input: "Turn left"
Output: LEFT