vit-eGTZANplus / README.md

ghermoso

Update README.md

cd08901 verified over 1 year ago

preview code

raw

history blame contribute delete

982 Bytes

metadata

license: apache-2.0
base_model: google/vit-base-patch16-224-in21k
tags:
  - generated_from_trainer
metrics:
  - accuracy
model-index:
  - name: vit-eGTZANplus
    results: []
datasets:
  - ghermoso/egtzan_plus
pipeline_tag: image-classification

Vision Transformer (ViT) for Music Genre Classification

Model Overview

Model Name: ghermoso/vit-eGTZANplus
Task: Image Classification
Dataset: egtzan_plus
Model Architecture: Vision Transformer (ViT)
Finetuned from model: This model is a fine-tuned version of google/vit-base-patch16-224-in21k on an egtzan_plus dataset.

It achieves the following results on the evaluation set:

Loss: 0.8358
Accuracy: 0.7460