LH-Tech-AI
/

GyroScope

+---
+license: apache-2.0
+tags:
+  - image-classification
+  - rotation-prediction
+  - resnet
+  - pytorch
+  - vision
+datasets:
+  - ILSVRC/imagenet-1k
+pipeline_tag: image-classification
+library_name: transformers
+---
+# 🔄 GyroScope — Image Rotation Prediction
+**GyroScope** is a ResNet-18 trained **from scratch** to detect whether an image is rotated by **0°, 90°, 180°, or 270°** — and correct it automatically.
+> Is that photo upside down? Let GyroScope figure it out.
+---
+## 🎯 Task
+Given any image, GyroScope classifies its orientation into one of **4 classes**:
+| Label | Meaning | Correction |
+|-------|---------|------------|
+| 0 | 0° — upright ✅ | None |
+| 1 | 90° CCW | Rotate 270° CCW |
+| 2 | 180° — upside down | Rotate 180° |
+| 3 | 270° CCW (= 90° CW) | Rotate 90° CCW |
+**Correction formula:** `correction = (360 − detected_angle) % 360`
+---
+## 📊 Benchmarks
+Trained on **50,000 images** from [ImageNet-1k](https://huggingface.co/datasets/ILSVRC/imagenet-1k) × 4 rotations = **200k training samples**.
+Validated on **5,000 images** × 4 rotations = **20k validation samples**.
+| Metric | Value |
+|--------|-------|
+| **Overall Val Accuracy** | **XX.XX%** |
+| Per-class: 0° (upright) | XX.X% |
+| Per-class: 90° CCW | XX.X% |
+| Per-class: 180° | XX.X% |
+| Per-class: 270° CCW | XX.X% |
+| Training Epochs | 12 |
+| Training Time | ~4h (Kaggle T4 GPU) |
+<!-- UPDATE the table above with your final results -->
+### Training Curve
+| Epoch | Train Acc | Val Acc |
+|-------|----------|---------|
+| 1 | 41.4% | 43.2% |
+| 2 | 52.0% | 46.9% |
+| 3 | 59.4% | 62.8% |
+| 4 | 64.1% | 66.0% |
+| 5 | 67.8% | 69.48% |
+| 6 | — | — |
+| 7 | — | — |
+| 8 | — | — |
+| 9 | — | — |
+| 10 | — | — |
+| 11 | — | — |
+| 12 | — | — |
+<!-- UPDATE with final epoch results -->
+---
+## 🏗️ Architecture
+| Detail | Value |
+|--------|-------|
+| Base | ResNet-18 (from scratch, **no pretrained weights**) |
+| Parameters | 11.2M |
+| Input | 224 × 224 RGB |
+| Output | 4 classes (0°, 90°, 180°, 270°) |
+| Framework | 🤗 Hugging Face Transformers (`ResNetForImageClassification`) |
+### Training Details
+- **Optimizer:** AdamW (lr=1e-3, weight_decay=0.05)
+- **Scheduler:** Cosine annealing with 1-epoch linear warmup
+- **Loss:** CrossEntropy with label smoothing (0.1)
+- **Augmentations:** RandomCrop, ColorJitter, RandomGrayscale, RandomErasing
+- **⚠️ No flips** — horizontal/vertical flips would corrupt rotation labels
+- **Mixed precision:** FP16 via `torch.cuda.amp`
+---
+## 🚀 Quick Start
+### Installation
+```bash
+pip install transformers torch torchvision pillow requests
+```
+### Inference — Single Image from URL
+```bash
+python3 use.py
+```
+## 💡 Example
+Input (rotated 180°):
+![cat image, rotated to the left by 90°](https://cdn-uploads.huggingface.co/production/uploads/697f2832c2c5e4daa93cece7/My19YzxFOYQkftVHw1rPr.png)
+GyroScope Output:
+<!--OUTPUT HERE-->
+<br>
+Corrected:
+![cat image, now correctly rotated](https://cdn-uploads.huggingface.co/production/uploads/697f2832c2c5e4daa93cece7/QhpbxPuwhyMj19N7v34iZ.png)
+## ⚠️ Limitations
+- Rotationally symmetric images (balls, textures, patterns) are inherently ambiguous — no model can reliably classify these.
+- Trained on natural images (ImageNet). Performance may degrade on:
+- Documents / text-heavy images
+- Medical imaging
+- Satellite / aerial imagery
+- Abstract art
+Only handles 90° increments — arbitrary angles (e.g. 45° or 135°) are **not supported**!
+Trained from scratch on 50k images — a pretrained backbone would likely yield higher accuracy (Finetuning).
+## 📝 Use Cases
+- 📸 Photo management — auto-correct phone/camera orientation
+- 🗂️ Data preprocessing — fix rotated images in scraped datasets
+- 🤖 ML pipelines — orientation normalization before feeding to downstream models
+- 🖼️ Digital archives — batch-correct scanned/uploaded images
+> Yesterday, I was sorting photos and like every photo was rotated wrong! This inspired me to make this tool 😂
+## 📜 License
+Apache 2.0
+## 🙏 Acknowledgments
+- Dataset: ILSVRC/ImageNet-1k
+- Architecture: Microsoft ResNet via 🤗 Transformers
+- Trained on Kaggle (Tesla T4 GPU)
+---
+>  GyroScope — because every image deserves to stand upright.