--- license: apache-2.0 metrics: - accuracy - f1 base_model: - google/vit-base-patch16-224-in21k --- Returns hand gesture based on image with about 96% accuracy. See https://www.kaggle.com/code/dima806/hand-gestures-image-detection-vit for more details. ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6449300e3adf50d864095b90/hGABRUvyao5roojmQY79K.png) ``` Classification report: precision recall f1-score support call 0.9256 0.9752 0.9498 11825 dislike 0.9784 0.9862 0.9823 11826 fist 0.9833 0.9870 0.9851 11826 four 0.9140 0.9357 0.9247 11826 like 0.9761 0.9101 0.9420 11825 mute 0.9831 0.9964 0.9897 11826 ok 0.9586 0.9658 0.9622 11825 one 0.9708 0.9453 0.9579 11826 palm 0.9764 0.9637 0.9700 11826 peace 0.9187 0.9367 0.9276 11825 peace_inverted 0.9784 0.9748 0.9766 11826 rock 0.9439 0.9361 0.9400 11825 stop 0.9502 0.9723 0.9611 11825 stop_inverted 0.9828 0.9546 0.9685 11826 three 0.9135 0.9068 0.9101 11826 three2 0.9799 0.9670 0.9734 11826 two_up 0.9570 0.9766 0.9667 11826 two_up_inverted 0.9754 0.9703 0.9729 11825 accuracy 0.9589 212861 macro avg 0.9592 0.9589 0.9589 212861 weighted avg 0.9592 0.9589 0.9589 212861 ```