--- license: cc-by-4.0 metrics: - accuracy --- # Image Scenery Classification This model is built on the efficientnet_b2 architecture. The model uses pretrained weights of the model found in the torchvision.models library. The classification head was changed, to be a dropout layer, followed by a linear layer with 6 target classes. Using transfer learning, the model was then trained on the [intel image dataset.](https://www.kaggle.com/datasets/puneet6060/intel-image-classification) See the corresponding [hugging face space](https://huggingface.co/spaces/richardschattner/scenery_classification) for a live demo of the model. #### Performance The model achieved a test accuracy of 89,67%. Misclassified images are often times ambiguous, such as a snowy mountain, being misclassified as 'glacier'. The model architecture is quite simple, when compared to SOTA architectures and produces fast predictions. A prediction on the hugging face space, hosted on the free cpu, takes about 0.2 seconds. The code is original and written by me.