frgfm/imagenette
Viewer • Updated • 40.2k • 5.99k • 20
How to use Misupatel/vit-imagenette with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("image-classification", model="Misupatel/vit-imagenette")
pipe("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/parrots.png") # Load model directly
from transformers import AutoImageProcessor, AutoModelForImageClassification
processor = AutoImageProcessor.from_pretrained("Misupatel/vit-imagenette")
model = AutoModelForImageClassification.from_pretrained("Misupatel/vit-imagenette")Fine-tuned from google/vit-base-patch16-224
on the Imagenette 160 px dataset.
| Metric | Value |
|---|---|
| Val accuracy | 99.52% |
| Val loss | 0.017721 |
| Best epoch | 9 |
| Classes | 10 |
tench, English springer, cassette player, chain saw, church, French horn, garbage truck, gas pump, golf ball, parachute
from transformers import AutoModelForImageClassification, ViTImageProcessor
from PIL import Image
import torch
model = AutoModelForImageClassification.from_pretrained("Misupatel/vit-imagenette")
processor = ViTImageProcessor.from_pretrained("Misupatel/vit-imagenette")
model.eval()
image = Image.open("your_image.jpg").convert("RGB")
inputs = processor(images=image, return_tensors="pt")
with torch.no_grad():
logits = model(**inputs).logits
predicted_class = model.config.id2label[logits.argmax(-1).item()]
print(predicted_class)
Base model
google/vit-base-patch16-224