Finetuned ViT model on the Food dataset to identify class of ['steak', 'pizza','sushi']