--- language: en license: apache-2.0 pipeline_tag: image-classification tags: - computer-vision - image-classification - mobilenet-v2 - cifar100 - whirlwindai datasets: - cifar100 metrics: - accuracy ---

---

# Vision, Simplified. Small models can recognize more than their size suggests. GVM explores efficient computer vision using lightweight architectures, fast inference, and practical deployment. Designed to run almost anywhere.

--- # Classification Performance

| Epoch | Training Loss | Validation Accuracy | |:------:|:-------------:|:------------------:| | **1** | 3.36 | **41.75%** | | **2** | 2.78 | **47.14%** | | **3** | 2.64 | **47.40%** |

--- # Quick Start ```python import torch import torchvision.transforms as transforms import timm import requests import json from PIL import Image config = json.loads( requests.get( "https://huggingface.co/WhirlwindAI/GVM/resolve/main/config.json" ).text ) model = timm.create_model( "mobilenetv2_100", pretrained=False, num_classes=config["num_classes"] ) state = torch.hub.load_state_dict_from_url( "https://huggingface.co/WhirlwindAI/GVM/resolve/main/model.pth", map_location="cpu" ) model.load_state_dict(state) model.eval() transform = transforms.Compose([ transforms.Resize(256), transforms.CenterCrop(224), transforms.ToTensor(), transforms.Normalize( mean=[0.485,0.456,0.406], std=[0.229,0.224,0.225] ) ]) image = Image.open("image.jpg").convert("RGB") tensor = transform(image).unsqueeze(0) prediction = model(tensor).argmax(1).item() print(config["class_names"][prediction]) ``` --- # Highlights

| | | |:---:|:---| | **Architecture** | MobileNetV2 | | **Dataset** | CIFAR-100 | | **Classes** | 100 | | **Model Size** | 14 MB | | **Framework** | PyTorch | | **Inference** | CPU & GPU Friendly |

--- # Repository Contents ``` model.pth config.json README.md ``` --- # Current Limitations - Trained for only **3 epochs** - Frozen backbone during training - CIFAR-100 is considerably harder than CIFAR-10 - Intended as an efficient baseline rather than a state-of-the-art classifier --- # Roadmap - Higher resolution training - Full backbone fine-tuning - Improved augmentation - ONNX export - TensorRT support - Interactive demo ---