motorcycle-vit-model

Base Architecture: google/vit-base-patch16-224
Fine-tuning: Transfer learning on custom motorcycle dataset
Framework: Hugging Face transformers + PyTorch
Task: Image Classification (4 classes)

A Vision Transformer (ViT) fine-tuned for motorcycle type classification into 4 categories: cruiser, sport, naked, roller.

Model Details

Label	Description
cruiser	Low seat height, forward foot pegs, relaxed riding position
sport	Full fairings, aggressive aerodynamic design
naked	Minimal fairings, exposed engine, upright seating
roller	Scooters with step-through frames and smaller wheels

Metric	Value
Validation Accuracy	70.59%
Validation Loss	1.116

Safetensors

Model size

85.8M params

Tensor type

F32